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A SURVEY OF MAX-TYPE RECURSIVE DISTRIBUTIONAL 

EQUATIONS 
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University of California, Berkeley and University of Minnesota 

In certain problems in a variety of applied probability settings 
(from probabilistic analysis of algorithms to statistical physics), the 
central requirement is to solve a recursive distributional equation of 
the form X = g{{^i, Xi),i > 1). Here (^i) and g{-) are given and the 
Xi are independent copies of the unknown distribution X. We survey 
this area, emphasizing examples where the function g{-) is essentially 
a "maximum" or "minimum" function. We draw attention to the the- 
oretical question of endogeny: in the associated recursive tree process 
Xi, are the Xi measurable functions of the innovations process (5i)? 

1. Introduction. Write V for the space of probability distributions on a 
space S; in our examples, S will usually be M or a subset of M. Suppose we 
are given a joint distribution for some family of random variables {£,i,i > 1), 
and given an S- valued function g{-) with appropriate domain (Section 2.1 
gives this setup more carefully) . Then we can define a map T -.V ^ V as 
follows: 

T(/x) is the distribution of g{{£,i, Xi),i > 1), where the (Xi) are 
independent with distribution ji, independent of the family (^j). 

Within this general framework one can ask about existence and uniqueness of 
fixed points, that is, distributions fj, such that T(//) = n, and about domain of 
attraction for a fixed point ^, that is, for what initial u do we have T'^{v) fj, 
as n ^ oo. One can rewrite such a fixed point equation in terms of random 
variables as 
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where the independence property is assumed imphcitly. We introduce the 
phrase recursive distributional equation (RDE) for equations of format (1), 
as opposed to alternate kinds of fixed point equation. RDEs have arisen in 
a variety of settings: 

• Galton-Watson branching processes and related random trees, 

• probabilistic analysis of algorithms with suitable recursive structure, 

• statistical physics models on trees, 

• statistical physics and algorithmic questions in the mean-field model of 
distance. 

Three aspects of this topic have been well studied. Perhaps the best known 
fixed point equation is 

(2) X = 2-^^^{Xi+X2) {S = R) 

whose solutions are the Normal (0, cr^) family. This example extends to give 
characterizations of stable distributions [63]. Moreover, there is a classical 
topic "characterization of probability distributions" [43] which considers the 
named families of distributions in mathematical statistics and studies many 
different types of characterization, some of which can be put into form (1). 
But this aspect is rather remote from our concerns. A second aspect concerns 
general methods for establishing existence or uniqueness of fixed points. Two 
natural methods (monotonicity; metric contraction) are recalled in Section 
2.2, while the more elaborate method of "a.s. unique tree representations" 
or "tree-structured coupling from the past" is described in Section 2.6. The 
third aspect is the linear subcase g{{S,i, Xi),i > 1) = J2i ^^id its variants, 
which we review in Section 3. This is well understood for 5 = M^, though 
not so well understood for S = M. 

The main purpose of this survey article is to illustrate the variety of 
contexts where RDEs have arisen, and to draw attention to another subclass 
of RDEs, those involving max-type functions g. We present in Sections 4-7 
a collection of around ten examples (summarized in Table 1) of max-type 
RDEs arising from concrete questions. Most have been studied in detail 
elsewhere; in each case we seek to outline the underlying problem, describe 
how it leads to an RDE, and give information about solutions of general 
or special cases. Section 8 provides brief remarks on Monte Carlo methods, 
process analogs and continuous-time analogs, and lists the numerous open 
problems and conjectures. 

On the theoretical side, in Section 2.3 we set out carefully some general 
theory, rather natural from the statistical physics or interacting particle 
system viewpoint but less apparent from the algorithms viewpoint, which 
relates RDEs to a type of tree- indexed process (X^) which we call recursive 
tree processes (RTFs). In particular we introduce the endogenous property 
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Table 1 

Some max-type RDEs. Functions g{-) for which the RDE X — g{{£,i,Xi),i > 1) are discussed* 



Section 


<?(■) 


Underlying model 


Endog? 


Comments 


4.2 


S = R+ 
maxi{Xi 


Range of BRW 


Yes 






4.3 


mmi(Xi 


Algorithm for BRW range 


Yes 






4.6 


maxi(^i — Xi)^ 


Matching on GW tree 


Yes 






4.4 


^0 + maxi(5iXi) 


Discounted tree sums 


Yes 




^0 = reduces to 












BRW extremes 


4.4 


^0 + mmi{£,iXi) 


Discounted tree sums 


Yes 




See (49) 


4.6 


(eo-E,^o+ 


Independent subset GW tree 


Yes 






7.2 




Percolation of MSTs 


Yes 




Determines critical c 


7.6 


See (98) 


First passage percolation 


Conj. 


Y 


Mean-field scaling analysis 


5 


5 = R 

c + maxi(Xi 


Extremes in BRW 


No 




c specified by dist(^i) 


7.3 


mmi(5i - Xi) 


Mean-field minimal matching 


Yes 






7.4 


minfl (Ci-X,) 


Mean-field TSP 


Conj. 


Y 


min'^' denotes 












second smallest 




Other S 










6 


$(mm(Xi,X2),^o) 


Frozen percolation on tree 


Yes 




$ defined in Section 6 


7.6 


See (96), (97), (98) Mean-field scaling 


Conj. 


Y 


5 = or 



*Note =max(x,0). For S = R a "max" problem is equivalent to a "min" problem by trans- 
forming X to —X, but for S = R"^ this does not work: the problems in Sections 4.2 and 4.3 are 
different. Typically the (^i) are either i.i.d. or are the successive points of a Poisson process on 
(0, oo). "Endogenous" refers to fundamental solution. Key to acronyms; BRW, branching random 
walk; GW, Galton- Watson; MST, minimal spanning tree; TSP, traveling salesman problem. 



(Definition 7), that in an RTP Xi is a measurable function of the driving 
tree-indexed process (^i) without any external randomness being needed, 
and show (Theorem 11) that endogeny is equivalent to a bivariate uniqueness 
property. 

A concluding Section 9 will attempt to review the big picture. 

1.1. Three uses of RDEs. When we look at how RDEs arise within spe- 
cific models in the Table 1 examples, we will see three broad categories of 
use, which seem worth mentioning at the start. 

1.1.1. Direct use. Here is the prototype example of direct use, where the 
original question asks about a random variable X and the distribution of X 
itself satisfies an RDE. 

Example 1. Let X be the total population in a Galton- Watson branch- 
ing process where the number of offspring is distributed as ^. In the case 
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E,^ < 1 [and = 1) 7^ 1] it is well known that X < 00 a.s., and then easy 
to check that dist(X) is the unique solution of the RDE 

X = l + Y,Xi (5 = Z+). 

1=1 

We will see other direct uses in Proposition 25 and in the examples in 
Section 4.4. 

1.1.2. Indirect use. The simplest kind of indirect use is where the quan- 
tity of interest can be written in terms of known quantities and some other 
quantity which can be analyzed via an RDE. See Proposition 28 and The- 
orem 41 for results of this kind. But there is a more intriguing kind of 
indirect use which we call a 540° argument, exemplified in the frozen per- 
colation model of Section 6 and also used in the mean-field combinatorial 
optimization problems in Sections 7.3-7.6. In these examples we start with 
a heuristically defined quantity X, and a heuristic argument that it should 
satisfy an RDE. Next we make a rigorous argument by first solving the RDE 
and then using the associated, rigorously defined RTP as building blocks for 
a rigorous construction. 

1.1.3. Critical points and scaling laws. We introduce this idea with an 
artificial example. 

Example 2. Let ^ be M-valued, = f3, and let > 1) be indepen- 
dent copies of ^. For fixed c G M consider the RDE 

(3) X=max(0,X + e-c) (5 = M+). 

Then there is a solution Xc on M"*" if and only if c > /3. Moreover, if var(^) G 
(0, 00), then 

(4) ^^-2?^ -^i"- 

Here (3) is a Lindley equation from classical queuing theory [12], and it is 
straightforward that for c> (5 the solution is 

d ^ 

(5) Xc=max^(^i-c). 

i=l 

This Xc is a.s. finite by the strong law of large numbers, and the stated 
asymptotics (4) follow from, for example, weak convergence of random walks 
to Brownian motion with drift. 
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We will see later three examples of problems involving critical values or 
near-critical behavior of some random system. In such problems there is 
a parameter c and we are interested in a criticcil va-luG Ccrit 

of c defined 

as the value where some "phase transition" occurs, or in behavior of the 
system for c near Ccrit- In Section 7.2 we see an example where the critical 
point is determined as the boundary between existence and nonexistence of 
a solution to an RDE (Proposition 56). In Section 4.3 we see how aspects of 
near-critical behavior may be reduced to study of near-critical solutions of an 
RDE (Theorem 29), and Section 7.6 contains more sophisticated variations 
on that theme. Note that in Example 2, result (4) shows that the behavior of 
solutions near the critical point scales in a simple way that does not depend 
on the details of the distribution of ^; according to the statistical physics 
paradigm of universality one should expect such scaling laws to arise in most 
natural problems. 

1.2. The cavity method. One particular topic of current interest concerns 
the cavity method in statistical physics, applied in the context of combinato- 
rial optimization in mean- field settings. There is a methodology for seeking 
rigorous proofs, in which the central issue becomes uniqueness of solution of 
some problem-dependent RDE. We will elaborate slightly in Section 7.5. 

2. The general setting. 

2.1. A precise setup. Here we record a more careful setup for RDEs. Let 
(5,5) be a measurable space, and let V{S) be the set of probability measures 
on {S,S). Let (6,T) be another measurable space. Construct 

e*:=ex y s"^, 

0<m<oo 

where the union is a disjoint union and where S"" is product space, inter- 
preting 5°° as the usual infinite product space and as a singleton set, 
which we will write as {A}. Let g:Q* ^ S he measurable. Let be a prob- 
ability measure on G x Z+, where Z+ := {0, 1,2,...; oo}. These objects can 
now be used to define a measurable map T:V{S) -^V{S) as follows. Write 
<* to mean < N for N < oo and to mean < oo for N = oo. 

Definition 3. r(^) is the distribution of g{S,,Xi, 1 <i<* N), where: 

(i) {Xi,i > 1) are independent with distribution /x; 

(ii) (^, N) has distribution u; 

(iii) the families in (i) and (ii) are independent. 
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Equation (1) fits this setting by writing C = (Ci). In most examples there 
is a sequence (^j), but for theoretical discussion we regard such a sequence 
as a single random element 

In examples where P{N = oo) > a complication often arises. It may be 
that g{-) is not well defined on all of 6 x 5°°, although g{^,Xi, 1 < i <* N) is 
well defined almost surely for (Xj)j>i i.i.d. with distribution in a restricted 
class of probabilities on S. For such examples and also for other cases where 
it is natural to restrict attention to distributions satisfying some conditions 
(like moment conditions), we allow the more general setting where we are 
given a subset V C V{S) such that g{^, Xi,l <i <* N) is well defined almost 
surely for i.i.d. {Xi)^^^ with distribution in V- Now T is well defined as a 
map 

(6) T:V^V{S). 

In this extended case it is natural to seek, but maybe hard to find, a subset 
V such that T maps V into V . 

2.2. Monotonicity and contraction. There are standard tools for study- 
ing maps T : V{S) ^{S) which do not depend on the map arising in the 
particular way of Definition 3. First suppose S" C M is an interval of the form 
[0,a;o] for some xq < oo, oi S = [0,oo). Consider the usual stochastic partial 
order ^ on V{S): 

fii ^ 1-12 iff /Ui[0, x] > ^2[0,x], xeS. 

Say T is monotone if 

fii ^ 1-12 implies r(/_fi) ^ T(^2)- 

Note that, writing for the probability measure degenerate at 0, if T is 
monotone, then the sequence of iterates T"Jo is increasing, and then the 
limit 

limT^Jo = 

n 

exists in the sense of weak convergence on the compactified interval [0, oo] . 

Lemma 4 (Monotonicity lemma). Let S be an interval as above. Suppose 
T is monotone. If fi^ gives nonzero measure to {oo}, then T has no fixed 
point on V{S). If fi^, gives zero measure to {oo}, and ifT is continuous with 
respect to increasing limits j /Uqo implies T(/_f„) | T(^oo)], then fi^: is a 
fixed point of T , and fi^, ^ fi, for any other fixed point fi. 

This obvious result parallels the notion of lower invariant measure in 
interacting particle systems [47]. 

Returning to the case of general S, the Banach contraction theorem spe- 
cializes to 
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Lemma 5 (The contraction method). Let V be a subset of 7^(8) such 
that T maps V into V . Let d be a complete metric on V . Suppose T is a 
(strict) contraction, that is, 

sup ^ X. 

Then T has a unique fixed point /i in V , whose domain of attraction is all 
ofV. 

A thorough account of specific metrics can be found in [57]. Most com- 
monly used is the Wasserstein metric on distributions on M with finite j»th 
moment, 1 < p < oo: 

(7) dp{^i,u) :=inf{(E[|Z-t^n)i/P|Z = /i and W = v}. 

Contraction is a powerful tool in the "linear" case of Section 3, where it 
also provides rates of convergence in the context of probabilistic analysis of 
algorithms. For max-type operations it seems less widely useful (see, e.g., the 
remark below Open Problem 62) except in simple settings (e.g.. Theorem 
32). It is also worth mentioning that in several examples we have no rigorous 
proofs of existence of fixed points (see Sections 5 and 7.4) and so the use of 
other fixed point theorems from analysis [45] might be worth exploring. 

2.3. Recursive tree processes. Consider again the setup from Section 2.1. 
Rather than considering only the induced map T, one can make a richer 
structure by interpreting 

X = g{^,X,,l<i<; N) 

as a relationship between random variables. In brief, we regard X as a 
value associated with a "parent" which is determined by the values Xi at 
N "children" and by some "random noise" associated with the parent. 
One can then extend to grandchildren, great grandchildren and so on in the 
obvious way. We write out the details carefully in the rest of this section. 

Write T for the set of all possible descendants i, where i = «i«2 • ■ ■ de- 
notes a dth-generation individual, the i^th child of its parent iii2 ■ ■ ■ id-i- 
Label the root as 0. Make T a tree by adding parent-child edges. Given 
the distribution on x Z from Section 2.1, for each i € T let (^i, Ni) have 
distribution z^, independently as i varies. Recall the function g from Section 
2.1. This structure — the random pairs {(,i,Ni), i G T, which are i.i.d. (v), 
and the function g — we call a recursive tree framework (RTF). In the set- 
ting of an RTF suppose that, jointly with the random objects above, we can 
construct S'-valued random variables X{ such that for each i 



(8) 



Xi = g{CuXij,l<j<* Ni) a.s. 
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(a) Markov Chain (b) Recursive Tree Process 



Fig. 1. On the left is a Markov chain represented as an iterated function system: X„ is 
the "outpuf of f with ^Hnputs" ^„ and Xn-i- On the right is an RTF; Xi is the '^output" 
of g with inputs and child ofi). In the figure, h is the parent of i and 

are siblings of i. 

and such that, independent of the values of {^i, A*";!! in first d — 1 generations}, 
the random variables {Xi\i in generation d} are i.i.d. with some distribution 
fid- Call this structure (an RTF jointly with the Xi) a recursive tree process 
(RTF). If the random variables X\ are defined only for vertices i of depth 
< d', then call it an RTF of depth d' . See Figure 1. 

Now an RTF has an induced map T:V{S) V{S) as in Definition 3. 
[In the extended case (6) we need to assume that T maps V into P.] 
Note that the relationship between an RTF and an RTF mirrors the re- 
lationship between a Markov transition kernel and a Markov chain. Fix 
an RTF. Given d and an arbitrary distribution fp on S, there is an RTF 
of depth d in which the generation-d vertices are defined to have distri- 
bution fid = fJ'^- Then the distributions fid, fJ'd-i, fJ'd-2, ■ ■ ■ ■, fJ'O at decreasing 
generations d,d — l,d — 2, . . . ,0 of the tree are just the successive iterates 
T{fi°),T^{ijP), T'^ifi^) of the map T. Figures 1 and 2 attempt to show 
the analogy between RTFs and Markov chains. 

One should take a moment to distinguish RTFs from other structures 
involving tree-indexed random variables. For instance, a branching Markov 
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Lemmas 14 and 15 use 
analysis of the RTP 



CFTP uses specific 
constructions X„ = /(^„,X„_i) 



RTF Xi = g(t„Xij,j>l) 



Markov chain specified 
by transition kernel K 



RDE specified by </{■) 
and dist(^) 



equation fi = fiK 
for stationary distribution 



associated T -.V ^ V 
gives fixed- point eq. T{/() : 



analytic methods 
to solve equation 



analytic methods 
to solve equation 



Fig. 2. The analogy between Markov chains and RTPs. Specifying a transition kernel is 
exactly what is needed to write the equation for a stationary distribution, and exactly what 
is needed to specify the chain. Analogously, specifying S and g{-) and dist(5) is exactly 
what is needed to write the RDE, and exactly what is needed to specify the RTP. But note 
these equivalences | occur at different conceptual levels. 



chain can also be represented as a family {Xi). But its essential property 
is that, conditional on the value Xi at a parent i, the values (Xii,Xi2, . . .) 
at the children il, 12, . . . are i.i.d. An RTP in general does not have this 
property. Conceptually, in branching processes one thinks of the "arrow of 
time" as pointing away from the root, whereas in an RTF the arrow points 
toward the root. 

Call an RTP invariant if the marginal distributions of Xi are identical at 
all depths. We have the following obvious analog of Markov chain station- 
arity. 
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Lemma 6. Consider an RTF. A distribution ^ is a fixed point of the 
induced map T if and only if there is an invariant RTF with marginal dis- 
tributions fi. 

An invariant RTF could be regarded as a particular case of a Markov 
random field, but the special "directed tree" structure of RTFs makes them 
worth distinguishing from general Markov random fields. 

A central theme of this survey paper is that for certain purposes, the 
most useful way of interpreting an RDE (1) is as the defining identity for 
an invariant RTF. 

2.4. Endogeny and bivariate uniqueness. Now imagine (8) as a system 
of equations for "unknowns" in terms of "known data" ^i. It is natural 
to ask if the solution depends only on the data. We formalize this as the 
following endogenous property. Write 

(9) ^T = cT(6,iVi, IGT). 

Definition 7. An invariant RTF is called endogenous if 

X0 is ^TT-measurable. 

A rephrasing is more intuitive. Within an RTF there is an embedded 
Galton- Watson tree T rooted at 0, whose offspring distribution N is just 
the marginal in u = dist{^,N). That is, the root has children; each 
such child i has A'^i children, and so on; T is the random set of all such 
descendants of the root 0. Write 

(10) g = a{^i,Ni, ieT). 
Then endogeny is equivalent to 

X0 is ^-measurable 

and this is the criterion we use in practice. 

It is intuitively clear (and true: Lemma 14) that when the Galton- Watson 
tree T is a.s. finite there will be a unique invariant RTF and it will be endoge- 
nous. But when T is infinite the "boundary behavior" may cause uniqueness 
and/or endogeny to fail. Theorem 11 will show that the endogenous property 
is equivalent to a certain bivariate uniqueness property. The simple artificial 
Examples 8, 10 and 13 should help to distinguish these properties from the 
"unique fixed point of T" property. 

Our first example shows that one cannot tell whether or not the endoge- 
nous property holds just by looking at T, even when the fixed point is unique. 
Write Bern(p) for the Bernoulli(p) distribution on {0,1}. 
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Example 8. Take S = {0, 1}. Define T : V{S) V{S) by r(^) = Bem(l/2) 
for all /i. So Bern(l/2) is the unique fixed point. We will give two RTFs with 
this induced T, one satisfying and the other failing the endogenous property. 

First take {C,N) with N = 2 and ^=Bern(l/2), and g{a,xi,X2) = a. 
Clearly the induced T is as above. In the associated RTF where Xi has 
Bern(l/2) marginals, observe that X0 = ^0 and so the endogenous property 
holds. Now consider the well-known von Neumann random bit extractor [30], 
which is a certain function g : {0, 1}°° {0, 1} which, applied to an indepen- 
dent Bern(p) input sequence for any <p< 1, gives a Bern(l/2) output. 
Set 



g{a,xi,X2,...) 




if Xi=X2 = Xs- 

.), if not. 



Take {(,,N) with N = 00 and ^= Bern(l/2), and then the induced T is as 
stated. In the associated RTF with Bern(l/2) marginals for Xi, the random 
variables .^i are never used, so X0 is independent of Q and the endogenous 
property fails. 



Bivariate uniqueness. In the setting of an RTF we have the induced 

map T:V ^V{S). Now consider a bivariate version. Write for the 

space of probability measures on 5*^ = 5 x 5 with marginals in V. Define 
T(2):7?(2)^p(52) 

r(2)(/i(2)) is the distribution of \ 1 < i <* iV),g(^,xf \ 1 < 

i <* N)), where: 

(i) {{xl^\x^'^''),i > 1) are independent with distribution /i^^) on p(2). 

(ii) {^,N) has distribution u; 

(iii) the families in (i) and (ii) are independent. 

The point is that we use the same realizations of N) in both components. 
Immediately from the definitions we have: 

(a) If ;U is a fixed point for T, then the associated diagonal measure fi^ 
is a fixed point for T^'^\ where 

IJL^ = dist(X, X) for /x = dist(X). 

(b) If /i^2) ig a fixed point for T^'^\ then each marginal distribution is a 
fixed point for T. 

So if /i is a fixed point for T, then /i^^ is a fixed point for T^^) and there 
may or may not be other fixed points of T^^) with marginals /i. 
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Definition 9. An invariant RTP with marginal /i has the hivariate 
uniqueness property if /i/" is the unique fixed point of T^^) with marginals 
II. 

The next example shows that even when /i is the unique fixed point of T, 
there may be fixed points of T^^) other than /i/*. 

Example 10. Take independent /, ^ such that / has Bern(l/2) distri- 
bution and ^ has Bern(g) distribution for some < g < 1. Consider the RDE 

X = X/+i+Cmod2; 5 = {0,1}. 

Here T maps Bern(p) to Bern(p') where p' = p{l — q) + {1 — p)q, so that 
Bern(l/2) is the unique fixed point of T. But product measure Bern(l/2) x 
Bern(l/2) is a fixed point for T^'^\ and this differs from (Bern(l/2))/'. 

2.5. The equivalence theorem. Here we state a version of the general 
result linking endogeny and bivariate uniqueness, without seeking minimal 
hypotheses. The result and proof are similar to standard results about Gibbs 
measures and Markov random fields (see Chapter 7 of [34] ) , but our emphasis 
is different, so it seems helpful to give a direct proof here, after a few remarks. 

Theorem 11. Suppose S is a Polish space. Consider an invariant RTP 
with marginal distribution fx. 

(a) // the endogenous property holds, then the bivariate uniqueness prop- 
erty holds. 

(b) Conversely, suppose the bivariate uniqueness property holds. If also 
T^"^^ is continuous with respect to weak convergence on the set of bivariate 
distributions with marginals ^i, then the endogenous property holds. 

(c) Further, the endogenous property holds if and only ifT^'^^^{ii ® ;u) -i /i^' , 
where fi® n is product measure. 

Here T^^^" denotes the nth iterate of T^'^\ Note that in part (c) we do 
not need to assume continuity of T^'^\ Also (c) can be used nonrigorously 
to investigate endogeny via numerical or Monte Carlo methods, as will be 
described in Section 8.1. For the record we state: 

Open Problem 12. Can the continuity hypothesis in (b) be removed? 

Example 13 (Noisy voter model on directed tree). This example shows 
that the endogenous property may hold for some invariant measures while 
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failing for others. Take S = {0, 1} and let ^ have Bern(e) distribution for 
small e > 0. Consider the RDE 

^+X2+X3>2) ™od 2. 

In words, a parent vertex adopts the majority opinion of its three children 
nodes, except with probability e adopting the opposite opinion. The Bern(p) 
distribution is invariant iff p satisfies 

p={l- e)q{p) + e{l - q{p)y, q{p) = p^ + 3p\l-p). 

There are three solutions ^,1 — p*(e)} where p*(e) J, as e J, 0. As 

in Example 10, the invariant RTF with Bern(l/2) marginal is not endoge- 
nous because the product measure is invariant for T^'^\ But the invariant 
RTF with Bern(p*(e)) marginal is endogenous; one can check that T^^) IS 
a strict contraction on the space of bivariate distributions with Bern(p*(e)) 
marginals, and then appeal to the contraction lemma and to Theorem 11(c). 

Remarks. Theorem 21 and Corollary 26 provide other contexts where 
the endogenous property holds for the "fundamental" invariant measure but 
not for others. Contexts where the fundamental invariant measure is nonen- 
dogenous are less common: Froposition 48 is the most natural example. 

Froof of Theorem 11. (a) Let be a fixed point of T^^^ with marginals 
Ai. Consider a bivariate RTF ((xj^^xf^), i G T) with u = dist(xi^\ X^^^). 
Define Gn '■= -^i) '■ gen(i) < n) where for i = iii2 ■ ■ - id we set gen(i) = d, 
its generation. Observe that Qn] 0- 

Fix A : S" ^ M a bounded continuous function. Notice that from the con- 
struction of the bivariate RTF, 

(Xi^^ (6, iVi), gen(i) <n)l (A^'); (6, M), gen(i) < n). 

So 

(11) E[A(Ai^))|a„]=E[A(A4'))|g„] a.s. 
Now by martingale convergence 

(12) E[A(Ai^))|g„] ^E[A(A4^))|e?]"^-A(A4^)), 

the last equality because of the endogenous assumption for the univariate 
RTF. Similarly, 

E[A(x4'))|g]'^^-A(Ai')). 

Thus by (11) we see that A(X0 ^) = A(A0^'*) a.s. Since this is true for ev- 
ery bounded continuous A we deduce X^^ = X^^ a.s., proving bivariate 
uniqueness. 
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(b) To prove the converse, again fix A : S* ^ M bounded continuous. Let 
(Xi) be the invariant RTF with marginal fi. Again by martingale convergence 

(13) K[A{X^)\gn] ^E[A{X^)\g]. 

Independently of {Xi,(^i,Ni, i G T), construct random variables (Vi,i S T) 
which are i.i.d. with distribution fi. For n > 1, define := if gen(i) = n, 
and then recursively define for gen(i) < n by (8) to get an invariant RTF 

(F;") of depth n. Observe that X0 = Y^. Further given Q')^ , the variables X0 
and Y^ are conditionally independent and identically distributed given Qn- 
Now let 



(14) 
We calculate 

(15) 



|E[A(X0)|g„]-A(Xg 



E[iA{X0)-E[A{X0)\gn]y 
E[var(A(X0)|g„)] 

MiMX^) - A{YS)f]. 



The last equality uses the conditional form of the fact that for any random 
variable U one has var([/) = ^E[{Ui - t/a)^], where Ui,U2 are i.i.d. copies 
of U. 

Now suppose we show that 



(16) (X0,y^)4(x^y^) 

for some limit. From the construction, 



Xz 

vn+l 

£71 



say, as n — i- 00 



X^ 



and then the weak continuity assumption on T*-^-* implies 



X* 
Y* 



X* 
Y* 



= Y^ = i^t for all n>l, and hence X* = Y* = fi. 



Also by construction we have 
The bivariate uniqueness assumption now implies X* = Y* a.s. Since A is a 
bounded continuous function, (16) implies A{X0) — A(Y^) — > a.s. and so 
using (15) we see that c^(A) — > 0. Hence from (14) and (13) we conclude 
that A(X0) is ^/-measurable. This is true for every bounded continuous A, 
proving that X0 is ^-measurable, as required. 

Now all that remains is to show that a limit (16) exists. Fix f : S ^ 
R and h: S two bounded continuous functions. Again by martingale 

convergence 

E[f{X0)\gn]^E[f{X0)\g], 
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and similarly for h. So 

E[fiX^)h{YS)]=m[f{X<,)h{YS)\gn]] 

= E[E[f{x^)\gnnhiYS)\gn]], 

the last equality because of conditional independence of X0 and given 
Qn- Letting n ^ 00 and using the conditionally i.d. property gives 

(17) E[fixMys)]^nnfiX0)\Gnh{X0)m- 

Moreover note that X0 = Y^ = fi and so the sequence of bivariate distribu- 
tions (X0,Y0) is tight. Tightness, together with convergence (17) for all 
bounded continuous / and h, implies weak convergence of {X0,Y0). 

(c) First assume that T(^)"(/i®/u) . Then with the same construction 
as in part (b) we get that 

{X0,Y0 ) {X0,X0). 

Further recall that A is bounded continuous; thus using (13), (14) and (15) 
we conclude that A{X0) is ^-measurable. This is true for any bounded 
continuous function A; thus X0 is ^-measurable. So the RTF is endogenous. 

Conversely, suppose that the RTF with marginal fi is endogenous. Let 
Ai and A2 be two bounded continuous functions. Note that the variables 
(^0,1^0), as defined in part (b), have joint distribution T^^^ (^u Cg) /i). Fur- 
ther, given Qn-! they ctrc conditionally independent ctnd lia,ve the so^nie con- 
ditional law as of X0 given Qn- So 

E[Ai{X0)A2{YS)] = E[E[A,{X0)\g„]E[MX0)\gn]] 
-^E[E[A,{X0)\g]E[A2{X0)\g]] 
= E[A,{X0)A2{X0)]. 

The convergence is by martingale convergence, and the last equality is by 
endogeny. So 

r(2)"(^ {X0,YS) ^ {X0,X0)^f^^. □ 

2.6. Tree- structured coupling from the past. The next lemma is clearly 
analogous to the coupling from the past (CFTF) technique for studying 
Markov chains [56]. That technique is part of a large circle of ideas {graphi- 
cal representations in interacting particle systems [47]; iterated random func- 
tions [27]) for studying uniqueness of stationary distributions, and rates of 
convergence to stationarity, for Markov chains via sample path construc- 
tions. 
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Lemma 14. Consider an RTF and write T for the associated Galton- 
Watson tree. Suppose there exists an a.s. finite subtree Tq CT containing 
such that the following property holds a.s. for each i: 

// i G To, then in the relation Xi = g{^i,X{j, 1 < j <* A'^i) the value 
of Xi is unchanged by changing the values of X\j for which ij ^Tq. 

Then there is a unique invariant RTF and it is endogenous. 

In particular, ifT is a.s. finite (equivalently, ifE[N] < 1 and P{N = 1) < 
1 ), then there is a unique invariant RTF and it is endogenous. 

Proof. Write ht(To) for the height of 7^. Fix d. Define {X^'^^ , gen(i) = d) 

arbitrarily, and then use (8) recursively to define {X^'^\gen{i) < d). The 

hypothesis implies that on the event {ht(7o) < d} the value of X^^ does 

not depend on the arbitrary choice of {X^'^\gen{i) = d), and equals some 
^-measurable random element. Letting d^ oo shows there exists some Q- 
measurable X0 such that 

PiX^^^ + X^) < P(ht(To) >d)^0. 

The same argument applied to a first-generation individual j shows there 
exists ^-measurable Xj such that 

^ / Xj) < P(ht(To) >d-l)^0. 

Use the i.i.d. structure of {{^i,Ni),i G T) to show that iXj,j > 1) are inde- 
pendent and distributed as X0. Then by the defining recursion 

X0=g{C,Xi,l<i<* N) 

and so dist(X0) is invariant. Moreover, in any invariant RTP it must be 
that X0 is this same r.v., proving uniqueness. □ 

Example 8 shows that (stochastic) monotonicity of the induced map T is 
not sufficient for endogeny. The next lemma shows that a stronger "pointwise 
monotonicity" condition on g is sufficient. 

Lemma 15. Suppose S = M'^. Suppose g:Q* is monotone for 

each 9. That is, if 1 <n < 00 and x = {xi,l < i <* n) and y = {yi,l < i <* n) 
are such that Xi < yi Mi, then g{9,x.) < g{9,y). Suppose that for fixed 9 the 
map X— >5((0,x) is continuous w.r.t. increasing limits. Suppose that, for 
the induced map T, the sequence {T"'{do),n > 0) is tight. Then r"(5o) — > n 
weakly, where the limit ji is invariant and the associated invariant RTF is 
endogenous. 
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Proof. This fi is the lower invariant measure of Lemma 4. Let (Xi) 
be the associated RTP. For each d there is a depth-d RTP (xj'^^) such that 
dist(Xj^'^^) = r'^~s™(')(5o)- Using the monotonicity hypothesis 

< X^^^ <X^^^ <---<X0 a.s. 

Since dist(X0'^^) — > /i we have X^^ | X0 a.s., and then since X^^ is (/-measurable 
we see that X^ is ^-measurable. □ 

2.7. Markov chains. Any Markov chain can be represented (distribu- 
tionally) as an iterated random function Xn = g{Xn-i,Cn) for i.i.d. and 
some g. So the stationary distributions (if any) are the solutions of 

X = giX,C). 

This is the special case of RDEs for which P{N = 1) = 1. In general when 
we talk about RDEs we are envisaging the case where P{N > 2) > 0. 

3. The linear case. The basic linear case is the case g{{£,, Xi)) = J2iLi S,iXi 
on S" = M. Note the (^j) may be dependent. This and the extension (20) have 
been studied quite extensively; our discussion focuses on analogies with the 
max- type cases later. Where the state space is M"*", the key ideas are from 
[28] which assumed N is nonrandom; the extensions to random N (which 
is a frequent setting for our max-type examples) have been developed in 
[49, 50]. Here is a typical result (Corollaries 1.5 and 1.6 of [49]; the case of 
nonrandom is in [28]; minor nontriviality assumptions omitted). 

Theorem 16. Suppose > 0, with > iff I <i < N, for some ran- 
dom < N < 00. Suppose N and J2iCi have finite (1 + 5)th moments, for 
some 5 >0. Write p{x) = ^[J2iCi]- Suppose there exists < a < 1 such that 
p{a) = 1 and p'{a) < 0. Suppose either: 

(i) a = l; or 

(ii) the measure X]i-P(log^i € •) is not centered-lattice, that is to say, not 
supported on sZ for any real s > 0. 

Then the RDE 
(18) X^Y^iiX, (5 = M+) 

i 

has an invariant distribution X with P{X = 0) < 1, and this solution is 
unique up to multiplicative constants. In case (i), E(A) < 00 if p'{a) < 0. In 
case (ii), if a <1, then P{X > x) ~ cx~" as x ^ 00, for some < c < 00. 
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One can study ([49], Theorem 6.1) the operator T corresponding to (18) 
with respect to the metric defined as at (7) but without the (•)^/" term. 
In the setting of Theorem 16 it turns out that the contraction coefficient is 
p{a) = 1 and hence the contraction argument cannot be used directly. The 
proof of Theorem 16 instead involves somewhat intricate analysis to find 
the moment generating function of X. See [38] for the case where N may be 
infinite. 

See [49] for many references to the appearance of the linear RDE (18) 
in the study of branching processes and branching random walks, invariant 
measures of infinite particle systems, and Hausdorff dimension of random 
Cantor-type sets. See [26, 60] for many references to linear RDEs arising in 
probabilistic analysis of algorithms which are analyzable by contraction. See 
[39, 40] for the specialization 



d 

i=l 



where {^i) are the points of a Poisson process on (0, oo). Often, within one 
model there are different questions which lead to both linear and max-type 
RDEs; instances can be found in Sections 4.1, 5 and 7.4. 

Questions of endogeny have apparently not been studied in this linear 
case. Note that Example 2 provides a (degenerate?) case of a linear RDE on 
M which is not endogenous. The following corollary deals with the simplest 
specialization of the Theorem 16 setting. 

Corollary 17. In the setting of Theorem 16, suppose (i) holds and 
p'{l) < 0, so that the RDE (18) has a solution X with EX < oo and P{X = 
0) < 1. Then the associated RTF is endogenous. 

Proof. Consider a solution of the bivariate fixed point equation 



{X,Y)^(^Y.^,X,,J2^,Y^- 



Observe 



\X -Y\ = 



and the expectations of the leftmost and rightmost terms are equal. So the 
inequality must be the a.s. equality 

(19) =J2^^\^i-Yi\ a.s. 
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By Theorem 11(b) it is enough to show X = Y a.s. Suppose not. Then Xi — Yi 
takes both positive and negative values. So we cannot have P{S,i > 0, ^2 > 
0) > or there would be nonzero chance of cancellation in the sum and 

(19) would fail. Thus the RDE can only be of the form X = ^iXi. But this 
can only happen if -P(Ci = !) = !) which case is excluded by the hypothesis 

/9'(1)<0. □ 

Open Problem 18. Study endogeny in the other cases of Theorem 16. 

It is worth pointing out that there is no very complete "general theory" 
for S* = M: 

Open Problem 19. Study analogs of Theorem 16 for 5" = M. 

Of course the contraction method remains useful in particular cases. See 
[22] for results on smoothness of solutions in the case of finite second mo- 
ment. 

3.1. The Quicksort RDE. A slight extension of the linear case is the case 

(20) g{{ii,Xi))^iQ + Y^i,Xi. 

i>l 

As a well-known concrete example, probabilistic analysis of the asymptotic 
distribution of the number of comparisons in the Quicksort algorithm leads 
to the study of the following RDE: 

(21) X = UXi + {l-U)X2 + C{U) {S = R) 

where C{x) := 2xlogx + 2(1 — x) log(l — x) + 1, and U = U{0, 1). There is 
a unique solution with E[X^] < 00 because T is a contraction under the 
metric ^2 at (7) [59]. But there are also other solutions. 

Theorem 20 ([31]). Let v he the solution of the RDE (21) with zero 
mean and finite variance. Then the set of all solutions is the set of distribu- 
tions of the form v * Cauchy(m, cr^) where m G M and > 0, and * denotes 
convolution. 

The next result basically says that none other than the "fundamental" 
solution of the Quicksort RDE (21) is endogenous. 

Theorem 21. An invariant RTF associated with the Quicksort RDE 
(21) is endogenous if and only if a = 0. 
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Proof. Let /i be a solution of the RDE (21), so using Theorem 20 
^ = V * Cauchy(m, cr^) for some m € M and o"^ > 0. Suppose {X,Y) is a 
solution of the bivariate RDE with marginals ^ 

fX\^(UX, + {l-U)X2 + C{U)\ 
^ ' \y ) \UYi + {l-U)Y2 + C{U) )' 

where [Xi^Yi) and {X2,Y2) are i.i.d. having the same distribution as {X,Y) 

and are independent of U = Uniform[0, 1]. 

First consider the case cr = 0. In this case both X and Y have finite 
second moment and hence so does D = X — Y . Naturally the distribution of 
D satisfies the RDE 

D = UDi + {l-U)D2 (onM), 

where Di = Xi — Yi^ i G {1,2}. Easy calculation shows that E [D] = = 
E[L)^]. Thus X = Y a.s. which proves endogeny for the invariant RTF with 
marginal //, by using part (b) of Theorem 11. 

Now consider the other case a > 0. Let Q be a random variable with 
distribution v and let {Z,W) be i.i.d. Cauchy(m, cj^), independent of Q. We 
claim that {X, Y) = (Q + Z,Q + W) is a solution of the bivariate equation 
(22). In that case X ^Y a.s. and hence part (a) of Theorem 11 implies that 
the invariant RTF with marginal fj, is not endogenous. □ 

So all that remains is to prove the claim, which will use the following 
lemma. 

Lemma 22. Let {Z\^Z2) he i.i.d. Cauchy(m,o"^) for some ?n G M and 

> and let C/= Uniform [0, 1] be independent of {Z\^Z2). Then V = 
U Zi + {1 — U)Z2 is a Cauchy(m, (T^) random variable which is independent 
ofU. 

Froof. We will calculate the characteristic function of V conditioned 
on U. Fix t G R; then 

= E[e^^*^i|C/] X E[e'^^-^^^^^\U] 

= exp{imtU - aU\t\) x exp{imt{l - U) - a{l - U)\t\) 
= exp(imt — \t\) 

as required. 

Now let {Qi,Q2) be two independent copies of Q and let (Zi, Z2, VFi, VF2) 
be i.i.d. Cauchy(m, cr^) which are independent of {Qi,Q2)- Define Xi = Qi + 
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Zi and Yi = Qi^ Wi for i € {1, 2}. Then {Xi.Yi) and (Xa, I2) are two i.i.d. 
copies of (X,y). Trivially iJXi + (1 - U)X2 + C(C/) = Q' + Z' and UYi + 
(1 - U)Y2 + C{U) = Q' + W, where Q' = UQi + (1 - U)Q2 + C{U), Z' = 

UZi + (1 - U)Z2 and VF' = UWi + (1 - ^7)1^2. Notice that Q' = Q and that 
by Lemma 22, Z' and W^' are i.i.d. Cauchy(m, cj^) which are independent of 

Q'. Hence (Q' + Z' , Q' + W) = {X, Y). This proves the claim. □ 

3.2. Moment recursions. Another feature of the linear and the extended 
linear cases is that one can give a recursion for the moments of the solutions 
X , assuming moments exist. For instance, in case (20) 

E[x^]=E[e,] + (^2j2ncoc.]^nx] 

\i,j>l,i^j / \j>l / 

Unfortunately one does not have analogous general explicit information in 
our max-type setting. 

4. Simple examples of max-type RDEs. The examples in this section 
are "simple" in a particular sense: one can construct an explicit solution 
(typically in terms of the stochastic process from which the RDE arises) 
without needing first to solve the fixed point equation analytically. 

4.1. Height of subcritical Galton-Watson trees. A Galton-Watson tree 
is the family tree of a Galton-Watson branching process with offspring 
distribution N , say, and with one progenitor. Exclude as trivial the cases 
P[N = 0) = 1 and P{N = 1) = 1. In the (sub)critical case E[iV] < 1, it is 
well known by probabilistic arguments that the branching process becomes 
extinct a.s., so that the random variable 

H := min{5f| no individuals in generation (7} = 1 + (height of the tree) 

is a.s. finite. By conditioning on the number of offspring of the progenitor, 
we see that H satisfies the RDE 

(23) H = l + ma.x{Hi,H2,...,HN), € {1, 2, 3, . . .}, 

where the max over an empty set equals zero. Lemma 14 shows this RDE 
has a unique solution and is endogenous (of course this is also easy to check 
directly) . 
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This RDE (23) is a natural prototype for max-type RDEs, and the fol- 
lowing section describes one direction of generalization. 

Note that the total progeny Z in the Galton-Watson tree satisfies the 
linear RDE in Example 1. This is one of several settings where aspects of 
the "typical" behavior are governed by a linear RDE while aspects of the 
"extreme" behavior are governed by a max-type RDE. 



4.2. Positive range of one- dimensional BRW. Consider a discrete-generation 
process in which individuals are at positions on the real line M. In generation 
there is one individual, at position 0. In generation 1 we see that individ- 
ual's offspring; there are offspring (for random < < oo) at positions 
oo > ^1 > ^2 > ■ • ■ 1 the joint distribution of (A^; ^j, i > 1) being arbitrary sub- 
ject to the moment condition (24) below. Inductively, each individual in 
generation n, at position x say, has A^' children at positions (x-l-i^Q, where 
the families {N';(^^) are i.i.d. for different parents. This process is (discrete- 
time, one-dimensional) branching random walk (BRW). The phrase "random 
walk" indicates the spatial homogeneity (otherwise we would have a branch- 
ing Markov chain). Some authors use BRW for the more special case where 
different siblings' displacements are independent of each other and of A^; we 
shall call this the IBRW (independent BRW) case. Write 



m{9)=E 



The moment condition we shall assume throughout is 
(24) 39 >0 such that m{e) < oo. 

The underlying Galton-Watson process, obtained by ignoring spatial posi- 
tions, may be subcritical, supercritical or critical (depending on E[A^], as 
usual). Consider 

Rn ■= position of rightmost individual in generation n 

with Rn = — oo if there is no such individual. Write nonextinction for the 
event that the process survives forever. Standard results going back to [19] 
show: 



Proposition 23. // the BRW is supercritical, then there exists a con- 
stant — oo < 7 < oo such that n~^Rn —>■ 7 a.s. on nonextinction. And 7 is 
computable as the solution of 

inf(logm(0)-76') = O. 
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Now consider R := max„>o Rn, the position of the rightmost particle ever. 
If the process becomes extinct a.s., or in the setting of Proposition 23 with 
7 < 0, we clearly have < R < co a.s. Studying R generalizes the study (Sec- 
tion 4.1) of the height of a Galton-Watson tree (take = 1), as well as the 
study of the rightmost position of a random walk [take = 1 and compare 
with (5)]. Applications to queueing networks are given in [44]. Conditioning 
on the first-generation offspring leads to the RDE below, and Lemma 15 
establishes the other assertions. 

Lemma 24. Suppose extinction is certain, or suppose supercritical with 
7<0. Then 



If extinction is certain, then R is the unique solution of this RDE and the 
RTF is endogenous. In the supercritical case, R is the lower invariant mea- 
sure for the RTF, and the associated invariant RTF is endogenous. 

There is an interesting critical scaling question here — see Open Problem 
30 later. The lemma also leaves open the question of whether there may be 
other invariant measures in the supercritical case. A thorough treatment of 
the latter question was given in [21] within a slightly more general setting, 
including the next result showing that nonuniqueness is typical. 

Proposition 25 ([21], Theorem 1). Under technical hypotheses (omit- 
ted here) the RDE (25) has a one-parameter family of solutions -^^(7), 0< 
7 < 00. Here X(0) is the lower invariant measure. There exists a > such 
that for each ^ >0 we have P{X{'y) > x) c^ exp(— ax) as x ^ 00, for some 
< c-y < 00. 

Without needing to go into the proof of Proposition 25, we can observe 
the following. 

Corollary 26. Under the hypotheses of Proposition 25, for 7 > the 
invariant RTF associated with X{'y) is not endogenous. 

Proof. Let {Qi, i (zT) be the associated BRW; that is, T is the family 
tree of descendants of the progenitor, and Qi is the position on M of indi- 
vidual i, with Q0 = 0. Fix d and consider the following construction. Let 
{Z^'^^ : gen(i) = d) be i.i.d. with some invariant distribution. For i G T define 



(25) 





(d) 



gen(i) = d, 
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and then for gen(i) = d — l,d — 2, . . . ,1,0 define 

yS''^ = max(0; - Qu gen(j) < d; Qj - Qi + zf, gen(j) = d) 

where j runs over all descendants of i. One can check that (Y^'^^) defines an 
invariant RTF of depth d. Now let be the event that, in the definition 

Y^''^ = max(0; Qj, gen(j) <d; + zf, gen( j) = d) 

the maximum is attained by some generation-d descendant. We use the 
following straightforward lemma whose proof is given later. 

Lemma 27. For an r.v. Y and (5 > define 

conc(dist(y),(5) = max P(a <Y <a + 5). 

Suppose (Zi) are i.i.d. with P{Z > x) ~ ce~"^ as x — > oo. Then there ex- 
ists 5 > 0, depending only on the distribution of Z , such that for every 
countable set (xi) of reals for which Y := maxj(xi + Zi) < oo a.s., we have 
conc(dist(y),(5)<l-5. 

On the event the r.v. Y^'^ is of the form in the lemma, with the role 
of the (xj) played by the ^^-measurable r.v.'s (Qj :gen(j) = d), where Qd is 
the <T-field generated by the first d generations of the BRW. So the lemma, 
together with the tail estimate in Proposition 25, implies 

conc(dist(yi'^^|e?d), 5)<l-5 on Ad- 

This estimate remains true for an invariant RTF (Yj) of infinite depth. If 
the RTF is endogenous, then Y^ is ^-measurable, and so the conditional 
distributions of Y^ given Qd converge as d — > oo to the unit mass at Ygj ; then 
the inequality above implies P{Ad) — > 0. But P{Ad) — > implies 

Y^ = max((5i : i G T) 

and so the invariant distribution is just the lower invariant distribution. □ 

Proof of Lemma 27. Suppose if possible the conclusion of the lemma 
is not true. Then for every 6n [ 0+ we can find a countable collection of reals 
{x'^)i>i such that Yn := maxj>i(x" + Zj) < oo a.s. and 

(26) P(0<y„<<5„)>l-5„. 

By assumption P[Z > x) ~ ce^"^ as x ^ oo, so 1^ < oo a.s. implies 

oo 

(27) 0<^e"^'^<oo. 

i=l 
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So in particular — — oo as i — > oo for every n > 1. Thus without loss of 
generality we can assume that {xf, i>l) are in decreasing order. 

Let F be the distribution function of Z, and write F(-) = 1 — F{-). We 
calculate 

P(0 < Yn < Sn) 

= l-P{Yn^[0,6n]) 

= 1 - P{Zi < -x" for ah i > 1, or Z,j > (5„ - for some i > 1) 

oo 

< 1 - n Pi-^ - O - ^axF{6n - X'i), 

1=1 *~ 

for arbitrary fixed A > 0. So from (26) we get 

oo 

(28) \\F{-\-x'}) + uis.i^F{5n-x'^)<5n. 

1=1 ~ 

But maxj>i F((5„ — ) = F{5n — a;"), so using (28) we get 

(29) lim F{5n - ) = =^ x? -oo as n ^ oo. 

n — ^oo 

Now fix e > 0. By hypothesis, there exists M > such that 

(30) (1 - e)ce-"^ < F{x) < (1 + e)ce-°^ for all x>M- A. 

Choose no > 1 such that for all n > no we have x" < —M, and hence xf < 
-M for alH > 1. Now from (28) 

(31) 5„ > F(<5„ - X?) > (1 - e)ce-"('^--^?) =^ e"^" < — ^5„e°'^" . 

c(l -e) 

Further, for any k^, 

Y[F{-X-xf) = Y[{l-F{-X-xf)) 

1=1 i=l 

> (l-F(-A-x^))*^" 

(32) 

> {l-{l + e)ce'^^e'^''")''" 

where the last inequaUty follows from (31). Now take = t c>c to get 

lim infrr^(-A-xn > 1. 

i=l 

This contradicts (28). □ 
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4.3. An algorithmic aspect of BRW. In the setting of Section 4.2 — a 
BRW satisfying (24) — there is an algorithmic question. Suppose we are in 
the supercritical case [say, P(N > 1) = 1 to avoid any chance of extinction] 
and suppose 7 > 0. So there exist individuals at large positive positions — 
how do we find them? Suppose we can access data only by making queries. 
A query 

children of progenitor? 
gets an answer 

progenitor has child G at position -0.4 and child V at position 
-0.8 

and a query 

children of G ? 

gets an answer 

G has child GF at position 0.6 and child GJ at posi- tion -1.6. 

There is a natural greedy algorithm for finding individuals with large positive 
positions. At each step, look at all individuals X named in previous steps 
for which one has not already made the query children of X?; then make 
this query for the individual X at rightmost position. See Figure 3. 

This greedy algorithm was studied in [2], in the special setting of binary 
IBRW which we now adopt (presumably much of what we say here holds 
in the general BRW setting). In analyzing the performance of the greedy 
algorithm, a key role is played by the position L of the leftmost individual 
ever queried. On the one hand this is given by 

(33) L = supmfQw. 

where the sup is over all lines of descent (wi) and where Qi is the position 
of individual i. On the other hand, by conditioning on the positions of 




><• — x_ 

Fig. 3. Algorithmic exploration of BRW. The individuals • have been queried m order 
1,2,3,4; the children X have not yet been queried. Individual 5 will be queried next. 
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the first-generation children we see that L must satisfy the RDE 

(34) L= min(^0,max(Li + -cx)< L < 0. 

This is genuinely different from (25), that is, one cannot obtain (25) from 
(34) by, for example, reflection. As shown in [2], Proposition 4.1, this RDE 
has a unique solution L, and from (33) we see the associated invariant RTP 
is endogenous. 

The actual question of interest in this setting in the speed of the greedy 
algorithm, defined as the limit 

speed := \\m.—Qi, 

n n 

where Vn is the nth vertex examined by the greedy algorithm. It turns out 
there is a simple formula for speed. 

Proposition 28 ([2]). For Unary IBRW, 

speed = E[(^ + L)+] 

where L is the unique solution of the RDE (34) and ^ independent of L is 
the displacement of a child. 

So Proposition 28 is a prototype for one kind of indirect use of an RDE. 
From this formula, but not a priori, one can deduce that the speed is strictly 
positive whenever 7 > 0. This leads to the question of near- critical scaling. 
Given a one-parameter family of distributions for offspring displacement (^j), 
parametrized by p say, and such that ■~f{p) > iS p > Pcriti we will typically 
have linear scaling for 7: 

7 (p) ~ c(p - Pcrit ) as p i Pcrit • 

But how does speed(p) scale? A special case permits explicit analysis. 

Theorem 29 ([3], Theorem 6). Consider an IBRW where each indi- 
vidual has exactly two children whose displacements satisfy P(^ = 1) = 
p, = —I) = 1 — p. The critical point pcrit 'is the smaller solution of 
16pcrit(l — Pcrit) = 1- The Solution L{p) of the RDE (34) satisfies 

- log P{L{p) = 0) = C{p - Pcrit)"^/^ + 0(1) as Pi pcrit 

for a certain explicitly defined constant c, and 

speed(p) =exp(-(c + o(l))(p-pcrit)~"^^^) as piPcrit- 

Open Problem 30. In the context of more general one-parameter fam- 
ilies of offspring displacements (^j): 
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(a) In the supercritical setting pjpcrit) study whether the scahng for 
speed (p) is as in Theorem 29. 

(b) In the subcritical setting p | Pcriti study the scahng of the range R{p) 
given in Lemma 24. 

4.4. Discounted tree sums. In this section we study the RDE 

(35) X = 7]+ max Ci^i (5 = M+) 

l<i<oo 

where (77; ^j, 1 <i <* N) has a given joint distribution, for random N < 00. 
There is a natural construction of a potential solution via what we will call 
discounted tree sums, as follows. Take the associated Galton-Watson tree 
T with offspring distribution A^. Put i.i.d. copies r/i of i] at vertices i. On 
the edges from each i to its children {ij)j>i put independent copies (^y, 1 < 
j <* M) of (^j, 1 < J <* A^). For an edge e = (i, ij) we will write to denote 
the edge weight ^y. Consider a path (0 = vo,vi, . . . ,Vci). View the random 

variable ry^,^ as having "influence" i]^aYl'j=i^[vj-i,vj) the root; that is, the 
influence is decreased by a factor ^ in crossing an edge. From an infinite 
path vr = (0 = vo,vi,V2, . . .) we get a total influence J2T=oVvd Il'j=iC{vj.i,vj), 
which we suppose to be a.s. finite. Finally set 

00 d 

(36) X= sup Y.^^ciY[^{vj-i,vj)- 

7r=(0=vo,fl,f2v) d=0 j = l 

If X < 00 a.s., then clearly it is a solution of the RDE (35), and this solution 
is endogenous. 

But it is not so easy to tell, directly from the representation (36), whether 
X is indeed finite. So for the record we state 

Open Problem 31. Study conditions under which (36) defines an a.s. 
finite random variable X. 

This question makes sense when we allow P{^i > 1) > 0, though the con- 
crete examples we know involve only the case < 1 a.s. We content our- 
selves with recording a simple contraction argument (essentially that of [58] , 
(9.1.18), in the setting of finite nonrandom N) designed to handle the case 

< 1 a.s. 

Theorem 32. Suppose < £,i < I and r/ > with E[7f] < 00 \/p>l. 
For 1 < p < 00 write c{jp) := X^i^o^I^f] — Suppose c{p) < 00 for some 
1 < p < 00. 

(a) The distribution ^ of X at (36) is an endogenous solution of the 
RDE (35) with all moments finite. For the associated operator T we have 

T"(5o)^/i. 
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(b) Take p < oo such that c{p) < 1. Then T is a strict contraction on the 
usual space J-p of distributions with finite pth moment. So ^ is the unique 

solution of the RDE with finite pth moment, and T"(/io)— for any /xq € 
J'p . 

Proof. By assumption c(po) < c>o for some pq, and then since < 1 we 
clearly have c{p) | as p | oo. So choose and fix 1 < p < oo such that c(p) < 1. 
Write J-p for the space of distributions on with finite pth moment. We 
will first check that T[Tp) C J^p. Let jj,^ Tp and let {X)i>i be i.i.d. samples 
from ^ which are independent of (^i)i>i and Define as the pth moment 
of fi. Observe that 



E 



( max(^iXi) 



< E 



max(efXf) 



li=l 



= [^i]p X c{p) < oo. 

Further we have assumed that E[?7^] < oo; thus using (35) we conclude that 
T maps Tp to itself. 

Let dp be the usual metric (7) on jFp. Fix /U, € J-p. By a standard coupling 
argument construct i.i.d. samples {{Xi,Yi))i>i such that: 

• they are independent of {Ci)i>i and rj; 

• Xi = fi and Yi = u for all i > 1 ; 

. idpifi,u))p=n\x,-Y,\p]. 

Put Z = 7] + maxj>i(^jXj) and W = rj + maxj>i(,^iyj). Notice that from 
definition Z = T{^i) and W = T{v). Now 



{dp{T{^,),T{v))f<nz-w\ 



E 



< E 



max^jXj 

i>l 



max^jFj 



Y^\iiX,-iiYi\P 



c{p) X dP{n,ij). 
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So T is a strict contraction map with contraction factor {c{p))^^'^ < 1. Since 
dp is a complete metric on Tp, the contraction method (Lemma 5) shows 

that there exists a fixed point Q J^p and that T"(/io) — > fJ- for each uq £ J-p. 

In particular, T"(5o) —^fJ'- But T"((5o) is just the distribution of 

n— 1 d 

(37) X("):= sup E^-.n^(«.-i,-.)- 

7r=(0=Uo,Ul,l'2v>fn-l) £^=0 j = l 

So /X is the distribution of X at (36). Finally, we can choose p arbitrarily 
large, so /i has all moments finite. □ 

While the argument in Theorem 32, bounding a max by a sum, is crude, it 
does serve to establish existence of solutions in the examples we will consider 
below. Let us first say something about uniqueness. 

Corollary 33. Consider the RDE (35). Suppose X at (36) is well de- 
fined (which in particular holds under the hypotheses of Theorem 32 j. Write 
fi = dist(X) for the lower invariant measure for the associated operator T . 
Consider the RDE obtained by omitting t] in (35); 

(38) X= max ^iXi {S = R+). 

l<i<oo 

// dist(y) is a nonzero solution of (38), then for < a < oo we have 

r"(dist(ay))4/,, 

and each fia is an invariant measure for T. If also 

(39) T] is independent of (^j); is in the support of r], 
then each fia is distinct. 

By analogy with Corollary 26 and Proposition 48 later we state: 

Conjecture 34. Under the assumptions of Corollary 33 and (39), for 
a > the invariant RTF associated with is not endogenous. 

Proof of Corollary 33. Write W for the operator associated with 
(38). Suppose V ^5q\s invariant for W . Set up the RTP {Y{) associated with 
(38). So for fixed n 

n 

Y0= sup Y^„Y[C{vj.uVj) 

n={0=Vo,Vi,V2,...,Vn) j — l 
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and the (Y^^ :gen(u„) =n) are independent with distribution u. And T'^{y) 
is the distribution of 



n-l d N 

„ TT ) 



(40) z(")= sup [yv„X{k-,-„v,) + Y.^v,X{ii,^_„. 

■K={0=Vo,Vl,V2,-,Vn)\ j = l d=0 j = l / 

Note the sample path monotonicity property 

< ^(n+i) a.s. 

which holds for the following reason. Given Vn, there is a Vn+i = Vni at- 
taining the maximum Y^^^ = maxi^y^iXy^i, and the right-hand side of (40) 
for {vo,vi, . . . ,Vn+i) is not smaller than the right-hand side of (40) for 

{vo,Vi,...,Vn)- 

This monotonicity, together with the facts 
implies existence of the limit 

] Z<oo a.s. 

So 

r"(i/) = dist(Z(")) ^ dist(Z) := ^ii say 

and by continuity, jii is invariant for T. 

Now for arbitrary a > define zi"'^ by replacing by aYy^ in (40). As 

above there exists a limit zi""^ | < oo a.s., and /ia := dist(Za) is invariant 
for T. 

To prove the final assertion of Corollary 33, fix < a < 6. Clearly Za < Zi, 
a.s., so it is enough to prove P{Zi, > Za) > 0. By Lemma 35 below (whose 
easy proof we omit) it is enough to prove 

(41) P{aY0 >X)>0. 

Let H be the a-field generated by the RTP (Yi) and by all the ^v,v'- By 
assumption (39) the (?7i) are independent of 7i, and it easily follows that 

P{X<e\n)>0 a.s., for each e>0. 

Since Y0 is Ti-measurable, this establishes (41). □ 

Lemma 35. For i = 1, 2 let fi >0 be a function such that f* := sup/j < 
00. For a > let q{a) := sup(a/i -|- /2). // afl > /|, then q{h) > q{a) for all 
b> a. 
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Corollary 33 hints that general solutions of the "discounted tree sum" 
RDE (35) correspond to solutions of the homogeneous RDE (38). Unfortu- 
nately the latter is not trivial to solve. For by taking logs (set X = logX, = 
log^) we see (38) is equivalent to 

X= max(|j + Xi) 

i 

and this RDE, to be studied in Section 5, is the fundamental example of 
a max-type RDE which cannot be solved by any simple probabilistic con- 
struction. See [41] for further discussion of (38). 

4.5. Examples of discounted tree sums. 

Example 36. Take U= Uniform(0, 1) and consider the RDE 

(42) X = l + max{UXi,{l-U)X2) {S = R+). 

This arises [25] in the context of the probabilistic worst-case analysis of 
Hoare's FIND algorithm. Theorem 32 implies existence of a fixed point with 
all moments finite, unique amongst possible fixed points with finite (1 + e)th 
moments. In a different way it can be proved [25] that any fixed point has 
all moments finite, and hence the fixed point is unique. 

Example 37. Consider the RDE 

(43) X = ?7 + cmax(Xi,X2) 
where < c < 1 . 

This arises [13] as a "discounted branching random walk." One interpre- 
tation is as nonhomogeneous percolation on the planted binary tree (the 
root has degree 1), where an edge at depth d has traversal time distributed 
at c'^T]. Then X is the time for the entire tree to be traversed. Assuming r] 
has all moments finite. Theorem 32 implies existence of a fixed point with 
all moments finite, unique amongst possible fixed points with finite expecta- 
tion. The same conclusion can be drawn in the slightly more general setup 
of a Galton- Watson branching tree with offspring distribution N. Instead of 
assuming rj has all moments finite, make the weaker assumption that there 
exists 9 > such that sup^-gj^ x^P{r] > x) < oo and mc^ < 1 where m = K[N]. 
Under these assumptions, [13] shows that the RDE (43) has a solution such 
that P{X > x) = o{x~") where a = — login/ logo. Moreover, this solution is 
unique in the class of distributions H such that x°(l — H{x)) — as a; — > oo. 
But outside this class there may be other solutions. 
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Example 38. Consider the RDE 



(44) 



X = 7] + max e 

i>i 




(5 = M+) 



where (^i,i > 1) are the points of a Poisson rate 1 process on (0,oo) and 
where rj has Exponential(l) distribution independent of (^j). 

This is a new example, arising from a species competition model [29]. Time 
reversal of the process in [29], together with a transformation of (0,1) to 
(0, oo) by X — > — log(l — x), yields a branching Markov process taking values 
in the space of countable subsets of (0, oo), which can then be extended to 
(— oo, oo) as follows. 

Each individual at position x at time t lives for an independent Exponential(e^) 
lifetime, after which it dies and instantaneously gives birth to an infinite 
number of children to be placed at positions {x + (,i)i>i where {£,i)i>i are 
points of an independent Poisson rate 1 process on (0, oo). The result of 
[29], transformed as above, shows that for each A < oo the Poisson rate A 
process on (— oo, oo) is a stationary law for this branching Markov process. 
We pose a different question. What is the extinction time X for the process 
started at time with a single particle at position 0? It is easy to see that 
X satisfies the RDE (44). 

For this example, easy calculation shows that c(p) = 1/p for p>l, so 
Theorem 32 implies existence of an invariant distribution with all moments 
finite which is also endogenous. 

Now in the setting of Corollary 33 consider the homogeneous equation, 
that is, with 77 = 



The solution X = oi (45) corresponds to the solution of (44) with all 
moments finite. We show below by direct calculation that there are other 
solutions of (45), which by Corollary 33 correspond to other solutions of (44) 
with infinite mean. 

Proposition 39. The set {Xa, a > 0) of all solutions of the RDE (45) 
is given by 



(45) 



X= maxe ^'Xi 



on S = R+. 



i>l 



(46) 




ifx<0, 



ifx>0. 



In particular for a = it is the solution 6q . 
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Proof. Let fihe a solution of (45). Notice that the points {(^j;Xj)|i > 
1} form a Poisson point process, say ^P, on (0, oo)^ with mean intensity 
dt fi{dx). Thus if F{x) = P{X < x), then for x > 

F(x) = PI maxe~'''X,' < x 

= P{no points of *p are in {(i,z)|e~*z > x}) 

^'^'^^ =exp^— J J dtfi{dx) 

e~*z>x 

We note that F is infinitely differentiable so by differentiating (47) we get 

(48) dF^F(.W-F(.)) 

dx X 

It is easy to solve (48) to verify that the set of all solutions is given by (46). 
□ 



Later results [Proposition 48(a) applied after taking logarithms] imply 
that for a > the invariant RTP associated with Xa is not endogenous. 



Example 40. Nonhomogeneous percolation on the binary tree. 



One can also consider the analog of (35) when max is replaced by min, 
though this situation does not seem to have been studied generally. One 
particular occurrence is in the setting of Example 37, interpreted as non- 
homogeneous percolation, in which case the time X taken to percolate to 
infinity satisfies the RDE 

(49) X = r] + cm.m{Xi,X2). 

This setting has been studied from a different viewpoint in [17]. 



4.6. Matchings on Galton-Watson trees. Amongst many possible exam- 
ples involving Galton-Watson trees, the following rather subtle example 
provides a warm-up to the harder example in Section 7.3. 

Consider an a.s. finite Galton-Watson tree T with offspring distribution 
N. Fix an arbitrary probability distribution v on (0, oo). Attach independent 
i^-distributed weights to the edges. A partial matching on T is a subset of 
edges such that no vertex is in more than one edge. The weight of a partial 
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matching is the sum of its edge weights. So associated with the random tree 
T is a random variable 

W := maximum weight of a partial matching. 

In seeking to study W via recursive methods, we quickly realize that a more 
tractable quantity to study is 

X : = maximum weight of a partial matching 

(50) — maximum weight of a partial matching 

which does not include the root. 

To see why, fix a child i of the root. Compare (a) the maximum- weight 
partial matching Mi which matches the root to i with (b) the maximum- 
weight partial matching A4- in which the root is not matched. 

These matchings must agree on the subtrees of all first-generation children 
except i. On the subtree rooted at i. Mi is the maximum- weight partial 
matching which does not include i, and is the maximum- weight partial 
matching. Thus weight(A^j) — weight(7V4_) = S^i — Xi where S^i is the weight 
on edge (root,i) and Xi is defined as X but in terms of the subtree rooted 
at i. Since in seeking the maximum- weight partial matching we can use any 
2, or no z, we deduce the RDE 

(51) X=ma-K{0,^i-Xi,l<i<N) 

where the Xi are independent copies of X and the are i.i.d. (ly). Uniqueness 
and endogeny follow from Lemma 14. 

This RDE, in the special case where has Poisson(l) distribution, arose 
in the context of the problem 

study the maximum weight Wn of a partial matching on a uniform 
random n-vertex tree, in the n — > cxd limit. 

The essential idea is that a randomly chosen edge of that tree splits it into 
two subtrees, the smaller of which is distributed as a Galton- Watson tree 
with Poisson(l) off'spring. For the detailed story see Section 3 of [11], whose 
final result is: 

Theorem 41. Suppose v is nonatomic and has finite mean. Then 
limn-W„ = Eei(5>x+z) 

where the r.v.^s on the right are independent, ^ has distribution v, X is 
distributed as the solution of the RDE (51) with Poisson(l) distributed N 
and Z is distributed as the solution of the RDE 

Z = max(X, ^ — Z) 
where the r.v.^s on the right are independent. 
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Let us mention the explicit solutions of our RDE in two special cases. 
Lemma 42. Let N have Poisson(l) distribution. 

(a) ([11], Lemma 2) If u is the exponential 1) distribution, then the so- 
lution of the RDE (51) is 

P{X <x)=exp{-ce-''), x>0, 

where c ^ 0.715 is the unique strictly positive solution of + e~'^ = 1. 

(b) If v is the Bern(j>) distribution, then the solution of the RDE (51) is 
the Bern(l — x{p)) distribution, where x = x{p) solves x = e~^^. 

A closely related "dual" problem concerns independent sets. Recall that 
an independent set in a graph is a subset of vertices, no two of which 
are linked by an edge. Take as before a Galton-Watson tree with off- 
spring, and a probability distribution u on (0, oo). Now assign independent 
i/-distributed random weights to each vertex and consider 

X := maximum weight of an independent set 

— maximum weight of an independentset which does not include the root. 

Similar to above, we can argue that X is the solution of the RDE 

(52) X= max^O,C-^Ai^ 

where has distribution v, independent of N . 

5. Rightmost position of BRW. Here we work in the setting of Section 
4.2. We have a BRW on M, where an individual has a random number N 
of offspring, whose random displacements from the parent's position are 
oo > > .^2 ^ ■ ■ ■ ) distributed arbitrarily subject to the moment condition 
(24). For simplicity suppose > 1 a.s. and P{N > 1) > 0. By Proposition 
23, the position of the rightmost individual in generation n satisfies 

n^^Rn — > 7 a.s. 

For reasons to be explained in the next section, one expects that under mi- 
nor extra assumptions (including a nonlattice assumption) a much stronger 
result is true: there exist constants 7^ such that 

(53) Rn — ^n—^ X as n — > cxD 

and that X is the unique (up to translation) solution of the RDE 

(54) X= -7 + max(^i + Xi), -00 < A < 00. 

i 

This is our first example of an RDE which is "not simple," in the sense that 
we do not know how to construct probabilistically a solution. 
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5.1. Tightness of Rn- At first sight it may be surprising that a limit 
(53) could hold, since it presupposes that the sequence {Rn — median(i2,„)) 
is tight, whereas one might expect its spread to increase to infinity. However, 
tightness is quite easy to understand. 

Lemma 43. // 

(55) (median(i?„+i) — median(i?„), n > 0) is hounded above, 
then 

(56) [Rn — median(i2„), n > 1) is tight. 

Harry Kesten (personal communication) attributes this type of argument 
to old work of Hammersley: it is perhaps implicit in [36], page 662. 

Proof of Lemma 43. Given e > we can choose k <oo and B > —oo 
such that 

P (generation k has at least log2 1/e individuals in [i?, oo)) > 1 — e. 

Then by conditioning on the positions of generation k, 

P{Rn+k <B + median(i2,„)) < 2e. 

Writing A for an upper bound in (55), we deduce 

P{Rn+k <B - Ak + median(i?„+fc)) < 2e. 

This establishes the tightness requirement for the left tail of Rn- For the 
right tail, given e < 1/4 we can choose k < oo and B > — oo such that 

^(generation k has at least e""*^ log 1/e individuals in [B, oo)) >1 — e 

(we changed the log2l/e term above to e""*^ log 1/e). Write g„ for the 1 — 
e quantile of so that P{Rn > Qn) > Again by conditioning on the 
positions of the generation /c, 

P{Rn+k <B + q.n)<e + {l- ef' <2e< 1/2. 

So median(i?„+fc) > B + qn, implying 

Qn < median(i?„4.fc) — B < median(i?„) + Ak — B. 

Since Qn is the 1 — e quantile of Rn, this establishes the tightness requirement 
for the right tail of Rn- □ 

In the case where all displacements are nonpositive (i.e., by reflection 
the case where displacements are nonnegative and we are studying the po- 
sition of the leftmost particle) it is clear that Rn is decreasing and so (55) 
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holds automatically, and then the lemma implies (56). A slicker argument 
for this case is in [24], Proposition 2. The same holds (by translation) if 
there is a constant upper bound on displacement, that is, if 

< ^^o) = 1 for some constant xq < oo. 

From these tightness results it is natural to expect that, under rather weak 
regularity conditions, we in fact have the convergence results 

(57) median(i?„_|_i) — median(i2„) ^ 7, 

(58) Rn — median(ii„) -i X, 

for some limit distribution X. Our interest in these limits arises, of course, 
because if (57) and (58) hold, then the limit X must satisfy the RDE (54). 

5.2. Limit theorems. This topic has been studied carefully only in the 
IBRW setting. We quote a recent result from [14], which provides an exten- 
sive bibliography of earlier work. The proof uses a mixture of analytic and 
probabilistic tools, for example, the "stretching" partial order (which goes 
back to the original KPP paper [46]), and multiplicative martingales. 

Theorem 44 ([14]). Consider an IBRW where E[N] <oo, N>1, P{N > 
1) > 0, and where the offspring displacement has density f{x) = e~'^^^^ for 
some convex function n. Then the limit 

Rn — median (i?„) -^X 

exists [and hence satisfies the RDE (54)]. //E[A^logiV] < oo and if a tech- 
nical assumption on (t>{0) :=E[A^]/ e^^f{x)dx [details omitted) holds, then 
the limit distribution is of the form 

(59) P{X <x)= E[exp(- exp(0o(>" + x)))] 
for some constant 9q and random variable Y . 

While the log-concave assumption plays a key role in the proof, it does 
not seem intuitively to be essential for the result. 

Open Problem 45. Under what weaker hypotheses does Theorem 44 
remain true? 

5.3. Endogeny. The viewpoint of this survey is to seek to study existence 
and uniqueness of solutions of RDEs separately from weak convergence ques- 
tions. This has not been done very systematically in the present context: 
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Open Problem 46. Study existence and uniqueness of solutions to (54) 
directly from its definition. 

However, Proposition 48 will show that the associated RDE is generally 
not endogenous. 

We first need to exclude a degenerate case. Write 

7* = ess sup ^1. 

If 7* < oo and = 7*} > 1, then there exist embedded infinite Galton- 

Watson trees on which the parent -child displacement equals 7*; it easily 
follows that there is the a.s. limit 

Rn — nj* X a.s. 

and that the associated invariant RTP is endogenous. The next lemma 
(whose easy proof is omitted) excludes this case. 

Lemma 47. Consider a BRW satisfying (24) and P{N > 1) = 1. // 
(60) 7* = 00; or 7*<oo and E#{i|^i = 7*} < 1, 

then n~^Rn — > 7 < 7*. 

Proposition 48. Suppose X is a solution of the RDE (54). Under 
either assumption (a) or assumption (b) below, the invariant RTP associated 
with this solution is not endogenous. 

(a) There exist constants c,a > such that P{X > x) ~ cexp(— ax) as 
X 00. 

(b) Suppose there is a BRW satisfying (24), (60) with P{N > 1) = 1 and 
P[N > 1) > 0. Suppose there exist constants 7„ such that 

Rn-ln-^ X, 7„ - 7„_1 7, 

SO that necessarily 7 = lim„n~^i?„ and X satisfies the RDE (54). 

Open Problem 49. Weaken the assumptions in Proposition 48. In par- 
ticular, does nonendogeny hold under only the assumptions of Lemma 47? 

From the viewpoint of the underlying BRW, nonendogeny is a property 
of the RTP associated with an n ^ cxj limit, so it is not obvious what its 
significance for the BRW might be. Informally, the argument in Section 5.4 
shows that nonendogeny is related to a kind of "nonpredictability" property 
of Rn- Given the ordered positions (Xn^i) of the nth-generation individuals, 
for iV > n write {pn,N{Xn,i), « > 1) for the probability that the rightmost 
individual in generation is a descendant of the Xn^i individual. Then there 
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exist limits Pn,oo{Xn,i) = ^^^N~>oo Pn.NiXn,i) ■ For fixed n this is maximized 
at the rightmost individual Xn.i, but it can be shown under suitable condi- 
tions that Pn,oo{Xn,i) — >■ as n ^ CX3. Loosely, it is unpredictable which line 
of descent leads to the rightmost individual at large times. 

5.4. Proof of Proposition 48. Using the notation of Corollary 26 let 
{Qi,i € T) be the associated BRW; that is, T is the family tree of the 
progenitor, and Qi is the position on M of the ith individual, with Q0 = 0. 
Fix d>l and let {Z^'^^ |gen(i) = d} be i.i.d. copies of X. For i € T define 

• Y^^^ = Z^'^\ when gen(i) = d; 

• yS'^'^ = max{Qj — Qi + 2j'^^|gen(j) = d and j is a descendant of i}, when 
gen(i) G {d- l,d-2,...,l,0}. 

It is easy to check that {Y^'^'^) defines an invariant RTF of depth d, for 
the RDE (54). 

Let Qd be the u-field generated by the first d generations of the BRW. So 
Qd] G, the cj-field generated by all the ^i's. Observe that 

(61) yi'') = max{Qj + Z^''^ |gen( j) = d}. 

Case (a). Under the conditional distribution given Q^, the random variable 
Y^'^^ has the same form as in Lemma 27, with the role of the (xi) being played 
by the ^^-measurable random variables ((5j,gen(j) = d), and the role of the 

(Zj) being played by the i.i.d. random variables (zj , gen( j) = d).So Lemma 
27 along with our assumption (a) of exponential right tail, implies that there 
exists 6 > such that 

(62) conc(dist(yi'^^|^rf),5) <l-6. 

This inequality is true for any invariant RTF of depth at least d, so in 
particular true for the invariant RTF (1^) of infinite depth, so we get 

conc(dist(Y0|a<i),(5) <l-5 =^ max P{a <Y0 < a + SlGd) < I - S. 

— oo<a<oo 

Now suppose that the invariant RTF were endogenous, that is, I0 is Q- 
measurable. Using the martingale convergence theorem we get for each ra- 
tional a 

l(a<y0<a+5) < 1 a.s. 

which is clearly impossible. 

For case (b) we need two lemmas. The first is straightforward (proof 
omitted) and the second is an analog of Lemma 27. 
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Lemma 50. Let po < 1. For each n let (C„.i, i > 1) be independent events 
with P{Cn,i) <Po- Suppose there is a random variable M* taking values in 
Z"*" = {0, 1,2,...; oo} such that 

lc„ i M* as oo 

i 

in the sense of convergence in distribution on 7L^ . Then either P{M* = 0) > 
or P(M* =oo) = 1. 

Lemma 51. Let (Zi) be i.i.d. nonconstant. For each n let {an/i,i > 1) be 
real constants. For k>\ let Mn^k be the kth largest of {an,i + Zi, i > 1). If 
Mn,i —^0 as n ^ oo, then for each k we have Mn^k ^0 as oo. 

Proof. Write 6* = ess sup Zi. Arrange {an,i,i > 1) in decreasing order. 
Since a„^i + Zi is asymptotically not greater than it is easy to see that 
limsup^a^,! < —6*. From nonconstancy of Zi it follows that for all e > 
there exist po <1 and no < oo such that 

P(a„,i + Zi > -e) <po, n>no. 

Apply Lemma 50 to the events {o„^i + Zi> —e}, passing to a subsequence 
to assume existence of a limit 

i 

By assumption P{M* = 0) = 0, so by Lemma 50 P(M* = oo) = 1, implying 
M„,fc^O. 

Case (b). Recall the argument leading to (61). Take (Z^^^) to be further 
i.i.d. copies of X and set 

fi'') = max{Qj + Zf ) |gen( j) = d}. 

Then the joint distribution (yj''^ y^*^^) is the distribution r(^)"(;^ (g) /i) ap- 
pearing in Theorem 11(c), and that theorem asserted that endogeny is equiv- 
alent to 

(63) {Y^''^ , yi'') )^{X,X) as d ^ oo. 

Suppose, to obtain a contradiction, that (63) were true. Writing for the 
Qd-measurahle r.v. defined as the median of the conditional distribution of 
Y^'^^ given Q^, (63) would easily imply 
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Now for k>l consider 

Bd,k = kth largest of {Qj + zj'^^|gen(j) = d}. 

So Bd^i — Ad^O. Now apply Lemma 51 conditionally on Q^, with the role 
of the (a„^j) being played by (Qj — Ad), to conclude that for each k we have 
Bd,k — ^ 0. (More pedantically, we need to detour through a subsequence 
argument to justify conditional application of Lemma 51; we omit details.) 
So 

(64) Bd,i-Bd,k^O. 

Next we exploit the underlying BRW. Write 

Rm,k = position of kth rightmost individual in generation m. 

Fix d and u > 0. For an individual j in generation d, the displacement of 
its rightmost descendant in generation m is asymptotically (m — s- oo) dis- 
tributed as X, independently as j varies, and so 

liminf ^(iim 1 - Rmk<u)> P{Bd I- Bdk<u) 

by considering the rightmost descendant of each of the k generation-d indi- 
viduals featuring in the definition of Bd^k- Now (64) implies 

P 

Rm,i — Rm,k ~^ as m ^ oo. 

But this property (for each k) states that an ever-increasing number of 
individuals accumulate near the rightmost individual, easily implying 

Rm+1,1 - Rm,i - ess sup ^ 0. 

This in turn implies linim = ess sup ^i, contradicting Lemma 47. 

□ 

Remark 1. Some multiplicative martingales used in the study of BRW 
(see, e.g., [20]) are of the form 

i 

where {Y^^i > 1) are the positions of the generation-n individuals. The a.s. 
limit Z{9) = \imnZn{0) satisfies the RDE 

i 

This is an instance of an "average-case" RDE paralleling the "extreme-case" 
RDE (54). 
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Remark 2. Very recently Iksanov has shown (personal communication) 
that one can derive existence of solutions to (54) by considering a related 
linear RDE. Interestingly, all those solutions have exponential right tail and 
hence by Proposition 48 none are endogenous. 

6. Frozen percolation process on infinite binary tree. A different setting 
where a particular "max-type" RDE plays the crucial role is the frozen 
percolation process on the infinite binary tree, studied in [5]. Let T3 = {V,£) 
be the infinite binary tree, where each vertex has degree 3; V is the set of 
vertices and £ is the set of undirected edges. Let {Ue)ee£ be independent edge 
weights with Uniform(0, 1) distribution. Consider a collection of random 
subsets At (^£ for < t < 1, whose evolution is described informally by: 

, .Ao is empty; for each e £ £, at time t = Ue set At = At- U {e} if each 
end- vertex of e is in a finite cluster of At-', otherwise set At = At-. 

(A cluster is formally a connected component of edges, but we also consider 
it as the induced set of vertices.) For comparison purposes, a more familiar 
process is Bt := {e £ £\Ue < t}, for < t < 1; which gives the standard per- 
colation process on T3 [35]. It is elementary that the clusters of Bt can be 
described in terms of the Galton- Watson branching process and that infi- 
nite clusters exist for t > | but not for i < |. The evolution of the process 
(Bt, <t <1) can be described informally by: 

for each e G £, at time t = Ue set Bt = Bt- U {e}. 

We notice that any process satisfying (*) must have At = Bt for t < ^ but 
At Q Bt ioT t > ^. Qualitatively, in the process (At) the clusters may grow 
to infinite size but, at the instant of becoming infinite, they are "frozen" in 
the sense that no extra edge may be connected to an infinite cluster. The 
final set Ai will be a forest on T3 with both infinite and finite clusters, such 
that no two finite clusters are separated by a single edge. 

Following [5] we call this process the frozen percolation process on T3. 
Although this process is intuitively quite natural, rigorously speaking it is 
not clear that it exists or that (*) does specify a unique process. In fact 
Itai Benjamini and Oded Schramm (personal communication) have an ar- 
gument that such a process does not exist on the Z^-lattice with its natural 
invariance property. But for the infinite binary tree case [5] gives a rigorous 
construction of a process satisfying (*), which can be summarized as follows 
(Theorem 1 and Proposition 2 of [5]). 

Theorem 52. There exists a joint law for {At, < t < 1) and {Ue, eG £) 
such that (*) holds and the joint law is invariant under the automorphisms 
of T3. Furthermore for a prescribed edge e and vertex v of T3, and fixed t 
in (^,1), the following are true: 
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(a) P{cluster containing e becomes infinite in [t,t-\-dt]) = -^dt, 

(b) P {cluster containing v becomes infinite in [t, t + dt\) = ^ dt, 

(c) P[e in some infinite cluster of Ai) = P{e in some finite cluster of Ai 
±,P{e^Ai) = '^, 

(d) P{v in some infinite cluster of Ai) = |, P{v in some finite cluster of Ai] 
^,P{v^Ai) = ^. 



6.1. 540° arguments. The phrase circular argument has negative con- 
notations, but we win describe what we term a 540° (i.e., one and a half 
circles) argument. In summary, the three half-circles are: 

• Suppose a process with desired qualitative properties exists. Do heuristic 
calculations leading to an RDE. 

• Solve the RDE. Use the associated RTF to make a rigorous construction 
of a process. 

• Repeat original calculations rigorously. 

In the next three sections we outline how this argument is used to prove 
Theorem 52. A similar 540° argument in a more sophisticated setting un- 
derlies the mean- field minimal matching example of Section 7.3. 

6.2. Stage 1. Suppose that the frozen percolation process exists on T3 
and has the natural invariance and independenc^properties. Define a mod- 
ified tree called the planted binary tree, written T3 = (V,<S), where one dis- 
tinguished vertex (we call it the root) has degree 1 and the other vertices 
have degree 3. Write e for the edge at the root. Clearly T3 is isomorphic 
to the subtree of T3 which can be obtained by first making some vertex 
the "root" and then removing two edges coming out of the root and their 
induced subtrees. Given independent Uniform(0, 1) variables, say (Ue)^^^, 

as the edge weights on T3, we suppose we can define a frozen percolation 
process on this modified tree. Let Y be the time at which the component 
containing the edge e becomes infinite, with y = 00 if never. Let ei and 62 
be the two edges which are coming out of the edge e; write the corresponding 
induced planted subtrees as ¥3^ and T^^. Let Yi and Y2 be the respective 
times for the edges ei and 62 to join an infinite cluster in ¥3^ or T32. Now 
consider T3. If Ug < min(yi,l2)5 then the edge e joins an infinite component; 
otherwise it never enters the process. Thus one can write 

(65) Y = ^mm{Yi,Y2),Ue), 

where $ : / x [0, 1] ^ /, with / = [i, 1] U {00} is defined as 

fx, if x > M, 



(66) $(x,u) 



00, if X < n. 
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Observe that the subtrees TsjjTsj are isomorphic to T3, and so Yi and Y2 
are independent and distributed as Y . And so the law of Y on the set / 
satisfies the RDE 

(67) y = $(min(yi,y2),c/), 

where U = Uniform(0, 1) and Yi,Y2 are i.i.d. and have the same law as Y. 
Fortunately this RDE is easy to solve. 

Lemma 53 ([5]). A probability law fx on I satisfies the RDE (67) if and 
only if for some € 1] 

(68) ii{dx) = ^dx, ^<x<xo; n{oo) = ^. 

Proof. Suppose that a probability law on / is a solution of (67) with 
distribution function F. Then from the definition of ^ 

F{x) = P{U <mm{Yi,Y2)<x), ^ < x < 1. 

Supposing F has a density F' on [^,1] (which can be proved by a more 
careful rephrasing of the argument); we get 

F'{x) = 2x{l- F{x))F'{x), l<x<l, 

and hence it follows that 

(69) F{x) = 1 - on [i, 1] n support(//). 

Since the function ^ ^ ^ — is strictly increasing, identity (69) can only 
happen when support(/x) = [^,Xo] for some \ < xq <1. 

Conversely it is easy to see that such a probability law on / satisfies the 
RDE (67). □ 

From the definition of Y in terms of frozen percolation on the planted 
binary tree, we expect the support of its distribution to be all of / = 1] U 
{cx)}, and so we choose the particular solution (68) with xq = 1, that is, the 
distribution u defined by 

(70) u{dy) = ^dy, ^ < y < 1, i^(oo) = ^, 
or equivalently 

(71) z.((y,oo]) = ^, \<y<^- 

Continuing to argue heuristically, we can now do the calculations recorded 
in Theorem 52: we give the argument for (a), and the other parts are similar. 
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Write 61,62,63,64 for the edges adjacent to the edge 6, and , T32 , T33 , T34 
for the corresponding planted binary trees which are ah isomorphic to T3, 
and further let Yi,Y2,Y3,Y4^ be the times at which the respective edges enter 
an infinite cluster of the frozen percolation processes restricted to the sub- 
trees. Writing Z for the time taken for the edge e to enter an infinite cluster 
(note Z = 00 if never), then 



(72) Z ■ 



min li, if C/g < min 

l<j<4 l<j<4 



^ 00, otherwise. 
Thus the density fz of Z on [h, 1] in terms of the law u olY can be written 



as 



Jz\x) = x X 4— X u {{x, 00)) = 4x X X I — I = — -. 



dx ' " 2x2 ^2x7 4x4' 

as asserted in part (a). 



6.3. Stage 2. We now start a rigorous construction based on knowing 
that the distribution u at (70) solves the RDE (67). For each edge e S <5 let 
6 and 6 be the two directed edges defined by e, and let £ be the set of all 
directed edges. Now for the directed edges we have a natural language of 
family relationship: the edge e = {v,w) has two children of the form {w,xi) 
and {w,X2)- It is not hard to use the Kolmogorov consistency theorem and 
the fact that z/ solves (67) to show 

Lemma 54 ([5]). There exists a joint law for {{Ug,Yg),e £ £) which is 
invariant under automorphisms 0/T3 and such that for each e£ £ 

Yg has law v, 

(73) 

Yg = $ (min(y^^ , F^J , Ug) a. s. , 
where e\ and 62 are children of e, and where for each 6 € Ug=Ue = f^e- 



Now we can outline the rigorous construction of the frozen percolation 
process. Essentially, one takes the heuristically obvious property (72) as a 
definition. In more detail, for an undirected edge e, define de as the set of 
four directed edges adjacent to e and directed away from it. Define 

(74) Ai := {e G £\Ue < mm{Ye> : e G de)}. 

Finally for < t < 1 define 



(75) 



At := {e G Ai\Ue < t}. 
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It is now clear that (At) inherits from (Yg) the automorphism-invariance 
property, as weh as the property that the only possible time at which an 
edge e can join the process is at time Ue- To check that (At) actually satisfies 
(*) and so it is a frozen percolation process is somewhat more elaborate, and 
this part of the proof [5] of existence is omitted here. 

6.4. Stage 3. We can now repeat the argument leading to (72) in terms 
of the explicit construction above and its modification on planted binary 
trees. This leads to part (a) as shown in Stage 1, and the other parts are 
similar. 

6.5. The endogenous property. If the RDE were nonendogenous, then 
the frozen percolation process would have a kind of "spatial chaos" prop- 
erty, that the behavior near the root was affected by the behavior at infinity. 
For several years we conjectured in seminar talks that the RDE is nonen- 
dogenous, but recently proved the opposite. 

Theorem 55 ([16]). The invariant RTF associated with (67), (70) is 
endogenous. 

7. Combinatorial optimization within the mean-field model of distance. 

In problems involving n random points in d-dimensional space, explicit cal- 
culations for n ^ oo asymptotics are often complicated by the obvious fact 
that the (g) inter-point distances are dependent r.v.'s. One can make a less 
realistic but more tractable model by eliminating the ambient d-dimensional 
space and instead assuming that the (2) inter-point distances are indepen- 
dent r.v.'s. This is the mean-field model of distance. Specifically, assume 
inter-point distances have exponential distribution with mean n, so nearest- 
neighbor distances are order 1. This model mimics true inter-point distances 
in d = 1 dimension; other distributions can be used to mimic other d without 
changing essential aspects of what follows. 

This model, and study of the minimal spanning tree and minimal match- 
ing problems within it, are surveyed in some detail in Sections 4 and 5 of 
[11]. Here we emphasize a different example, in Section 7.2, and only briefiy 
record the RDEs arising in the minimal matching, traveling salesman and 
variant problems (Sections 7.3-7.6). 

7.1. The PWIT approximation. In the mean-field model above, the key 
feature is that there is an n ^ cxo "local weak limit" structure called the 
PWIT {Poisson weighted infinite tree), which describes the geometry of the 
space as seen from a fixed reference point. In brief (see [11] for more details) 
consider a Poisson point process 



(76) 



< 6 < 6 < 6 < 
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of rate 1 on (0,oo). Take a root vertex 0. Let this root have an infinite 
number of children 1,2,3, ... , the edge-lengths to these children being dis- 
tributed as the Poisson process > 1) at (76). Repeat recursively; each 
vertex i has an infinite number of children (ij, j > 1) and the edge-lengths 
Ciji J ^ 1 are distributed as the Poisson process (76), independent of other 
such Poisson processes. See Figure 4. 

7.2. Critical point for minimal subtrees. Consider the mean-field model 
on n points as the complete graph Kn on n vertices, and write for the 
length of edge e. For a subtree t, that is, a tree whose vertices are some 
subset of the n vertices, write 

|t| = number of edges in t, 

L(t) = ^ = total length of t, 

est 

a(t) = L(t)/|t| = average edge-length of t. 

A well-known result [32] on minimal spanning trees says that, if we insist 
on |t| = n — 1, then the smallest we can make a(t) is about ((3) := 
If we fix < e < 1 and consider subtrees with around en edges, then we 
guess that the smallest value of a(t) should be around S{e) as n —f oo, 
for some deterministic function 6(e). It is not hard to show that 5{e) > 
for large e while 5{e) = for small e. So there must be a critical point at 
which 6{-) leaves 0; this is analogous to critical points in percolation theory. 
What is interesting is that the critical point is determined by an RDE. It 
is convenient to turn the problem around and study the maximum size of 
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a subtree t subject to the constraint that a(t) < c. For fixed < c < cx) 
consider the RDE on 5 = [0, oo) 

oo 

Y= ^(c — ^j + li)"'"; (^i) a Poisson rate 1 point process on (0,00). 

(77) 

Proposition 56 ([4]). Define M{n,c) =max{|t| :t a subtree of Kn,a{t) < 
c}. Then there exists a critical point c(0) € [e~^,e~^] such that 

(78) ifc<c{0) then n'^M{n,c) -^0, 

(79) ifo c(0) then 3 t]{c) > such that P{n~'^M{n, c) > r]{c)) 1, 
and 

c(0) =va.i{c\RDE (77) /las no solution on [0,oo)}. 

The conceptual point to emphasize is that, by analogy with Example 2, we 
are studying an "average" by studying whether an associated "compensated 
sum" is finite or infinite. 

First we explain how the RDE (77) arises. In the PWIT define, for integers 
/i>0, 

= sup{c|t| -L(t)|root Gt,height(t) < h], 

where the sup is over subtrees t of the PWIT, and where height (t) denotes 
the maximum number of edges in a path in t from the root. To obtain 
the maximizing t one simply considers in turn each child i of the root and 
considers whether one gets a positive contribution by including child i in t. 
The contribution equals 

where is the length of edge from root to child z, and Y^^ is a sup over 
subtrees tj of child i: 

y/''-^) = sup{c|ti| - L(ti) I height (ti) <h-l}. 

So 

oo 

(80) yW=^(c-e, + y/"-^')+ 

1=1 

where the {Y^^ are, by the recursive structure of the PWIT, independent 
copies of y('^-i). Writing for the map on distributions associated with 
the RDE (77), the last equality says 

T,'^(<5o) = dist(y(^)). 
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Lemma 15 then implies that for c < c(0), 

T^{6o) He as /i ^ oo, 

where /Xc, supported on [0,00), is the lower invariant measure; and that the 
RTP is endogenous. Indeed, /ic is just the distribution of 

=sup{c|t| - L(t)|root G t, t finite}. 

Outline proof of Proposition 56. Fix c < c(0). Roughly, the fact 
that y(°°) is finite implies that there cannot exist large subtrees of the 
PWIT with average edge-length much greater than c; the fact that the PWIT 
represents the local structure of Kn for large n implies that the same should 
hold for Kn', this is the lower bound (78) of the proposition. Let us amplify 
this argument into four steps. Fix an integer m. 

Step 1. The connection between Kn (the complete graph on n vertices 
with random edge-lengths) and the PWIT is provided by local weak conver- 
gence of the former to the latter — see [11], Theorem 3 for formalization. A 
soft consequence of local weak convergence is 

limdistmaxjcltl — L(t)|t C ir„,root € t, Itl < 3m| 

(81) n L I I Wl , I I _ J 

is stochastically smaller than T^^5q. 

Indeed, we would have asymptotic equality if we required only that t have 
depth < 3m; but we make a stronger restriction. 

Step 2. The quantity above can be used to bound the chance of the 
event: there exists a small tree t containing the root and with c|t| — L{t) > x. 
Consider the mean number of vertices v satisfying that event (with v in place 
of root) and apply Markov's inequality to deduce the following: 

P{3 at least 6n vertices v s.t. 3t3v with c|t| — L{t) > x and |t| < 3m) 
is asymptotically < fic[x,oo]. 

Step 3. Any tree with at least 3m edges can be split into edge-disjoint 
subtrees, each having between m and 3m edges. 

Step 4. We are assuming c < c(0), so interpose c< c' < c* < c(0). If the 
desired conclusion (78) were false, there would be some "big" tree t with |t| > 
en and a(t) < c. Use step 3 to split into "small" subtrees; a deterministic 
averaging argument shows there would exist at least dn vertices v in small 
trees t' with a(t') < c' . Here 6 depends on £,c,c' but not on n,m. These 
trees now satisfy c*|t| — L(t) > m{c* — c'). Applying step 2, the chance of 
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this many small trees is at most fic*[fn{c* — c'),oo]. Since m is arbitrary 
and /ic*(oo) = we get the lower bound (78). We are done. 

The argument for the upper bound (79) is parallel. For c > c(0) each v 
has nonvanishing chance of being in some large finite tree t with a(t) > e(c), 
and one can patch together these trees to get an r2(nO-size tree t with 
a(t))>e(c). □ 

Incidentally, the lower bound c(0) > stated in Proposition 56 arises 
from the first moment method, and the upper bound c(0) <e~^ comes from 
considering paths as a special case of trees. Moreover, numerically c(0) ~ 
0.263. See [4] for details. 

7.3. Minimal matching. Consider again the mean-field model of distance, 
that is, the complete graph Kn with random edge- lengths with exponential 
(mean n) distribution. Take n even and consider a (complete) matching, 
that is, a collection of n/2 vertex-disjoint edges. Define 

Mn = minimum total length of a complete matching. 

This problem is often studied in the bipartite case ([61], Chapter 4) but 
the two versions turn out to be equivalent in our asymptotic setting. The 
following limit behavior was argued nonrigorously in [53] and proved (in the 
bipartite setting) in [1, 6]. There are fascinating recent proofs [48, 55] of an 
underlying exact formula for EM„ in the bipartite, exponential distribution 
setting, but it seems unlikely that the applicability of exact methods extends 
far into the broad realm of problems amenable to asymptotic study. 

Theorem 57. f EM„ tt^/G. 

The technically difficult proof is outlined in moderate detail in Section 5 
of [11]. Here we emphasize only the underlying RDE, and some analogous 
RDEs arising in analogous problems. 

The central idea is that, since the PWIT originates as a "local weak 
limit" of Kn, one can relate matchings on Kn to matchings on the PWIT. 
The technically hard, though noncomputational, part of the proof is to show 
that the limit lim„ ^EMn must equal 

i^^p(typical edge-length of Ai) : M an invariant matching on the PWIT}. 

Here invariant means, intuitively, that in defining the matching M on the 
PWIT, the root must play no special role. Now one can see how to con- 
struct the optimal matching A^opt on the PWIT by reusing two ideas we 
have seen earlier in this survey. First, we use the 540° argument from Section 
6.1: start with heuristically defined quantities, obtain an RDE and use its 
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solution as a basis for rigorous construction. Second, we use the idea from 
Section 4.6 of seeking a recursion for a quantity defined as a difference. 
Write T for the PWIT. Consider the definition, analogous to (50), 

, X X0 = length of optimal matching on T 

— length of optimal matching on T \ {0}. 

Here we mean total length, so we get 00 — cx), and so this makes no sense 
rigorously. But pretend it does make sense. Then for each child j of the root 
we can define Xj similarly in terms of the subtree T-' rooted at j: 

Xj = length of optimal matching on T-' 

— length of optimal matching on \ {j}- 

One can now argue, analogously to (51), 

(84) X^= mm {^j - X,), 

l<j<oo 

(85) )ot is matched to the vertex argminj(,^j — Xj) in the optimal matching. 

Recall (^i) is the Poisson process (76). This motivates us to consider the 
RDE 

(86) X= mm (Ci-Xi) {S = R). 

1< j<oo 

Luckily, this turns out to be easy to solve. 

Lemma 58 ([6], Lemma 5). The unique solution of (86) is the logistic 
distribution 

(87) P{X <X) = 1/{1 -CX)<X<CXD, 

or equivalently the density function 

f(x) = {e^/^ + 6-'-'/^)-^ -oo<x<oo. 

Implementing the 540° argument, we will use the logistic solution or the 
RDE to construct a random matching on the PWIT. Each edge e in the 
edge-set E of T corresponds to two directed edges e, e : write E for the 
set of directed edges and write ^(e) =C(e) for the edge-length. For 

a directed edge {v,w) we can call the directed edges {{w,x)\ x v} its 
children. The Kolmogorov consistency theorem and the logistic solution of 
(86) imply (cf. Lemma 54) 

Lemma 59. Jointly with the edge-lengths (^(e),e S E) of the PWIT we 
can construct {X(e),eGE} such that: 

(i) each X{e) has the logistic distribution, 
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(ii) for each e, with children 6*1,62,... say, 

X(6-)= mill (^(e,)-X(e-,)). 

l<_;<oo 

Theorem 61 will show that X(v,v') depends only on the edge- lengths 
within the subtree rooted at v' . Guided by the heuristic (85), for each vertex 
V define 



V* = argmin(^(i),f') — X(v,v')). 

v' 

In view of (82) an outline proof of Theorem 57 can be completed by proving 

Proposition 60 ([6], Lemma 16, Propositions 17 and 18). (a) The set 
of edges {v,v*) forms a matching A^opt on the PWIT. 

Write Mopti^) for the vertex to which the root is matched in A^opt; 
so that the mean edge-length in Alopt can be written as E,^(0, A4opt(0)). 
Then: 

(b) E^(0,A^opt(0))=vrV6. 

(c) For any invariant matching M, 

(90) E^{0,M{0))-EC{0,Mopt{0))>O. 

Let us indicate only the proofs of (a) and (b). For (a) we need only show 
that (v*)* =v. Note first 

^{v, V*) — X{v, V*) < min(,^(t;, y) — X{v, y)) by definition of v* 
= X{v*,v) by recursion (88) 

or equivalently 

(91) ^{v,v*)<X{v,v*)+X{v*,v). 
And if z y^v* is another neighbor of v, then 

^{v, z) - X{v, z) > min(^(z;, y) - X{v, y)) 

= X{z,v) 

or equivalently 

i{v,z) > X{v,z)+X{z,v). 

We conclude that v* is the unique neighbor of v satisfying (91). But the 
right-hand side of (91) is symmetric, so applying this conclusion to v* shows 
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To prove (b) we calculate the mean length of the edge at the root. In order 
for this length to be x, there must be an edge of length x from the root to 
some vertex j, and also (91) we must have x < X(ioot,j) + X(j, root). But 
these are distributed as independent logistics, say Xi and X2, and so 

(92) Ee(0,Xopt(0))= / xdx P{x<Xi + X2) 

Jo 

= ^E,{{Xi + ^2)"^)^ by a general formula 
= jE{Xi + X2f by symmetry 

= \¥.Xl 

(93) = vrVe, 

the last step using a standard fact that the logistic distribution has variance 
vrVS. 

Theorem 61 ([15]). The invariant RTF associated with the RDE (86) 
is endogenous. 

The significance of this result is pointed out in Section 7.5. The proof 
involves somewhat intricate analytic study of the iterates T(^)" (/i x ^u) to 
verify Theorem 11(c). We remark that we have not succeeded in using con- 
traction methods to prove Theorem 61. Indeed the operator T associated 
with the RDE (86) is not a strict contraction. To see this, it is easy to check 
that T is well defined on the subspace Vi of distributions with finite mean. 
Moreover, if X has logistic distribution, then the distribution of c + X is a 
fixed point for for any c € M. Hence T cannot be a strict contraction on 
the whole of Vi . This shows that the logistic solution of (86) does not have 
full "domain of attraction," but the question of determining the domain of 
attraction remains open. 

Open Problem 62. Find the subset V cVi such that T^{i^)-^fi as 
n ^ 00 if and only if z/ G D, where is the logistic distribution. 

Remark. The way we started the heuristic argument at (83) may seem 
implausible, because one might expect analogous quantities in finite-re set- 
ting to have spreads which increase to infinity with re. But a better analogy 
is with the position i?„ of the rightmost point in BRW; its spread (Lemma 
43) stays bounded with re. 
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7.4. TSP and other matching problems. Here we follow Sections 6.1 and 
6.2 of [6], which showed how earlier work [51, 52] fits into the current frame- 
work. 

As suggested at the start of Section 7, one can define a mean-field model 
of distance with any real pseudo- dimension < d < cx) to mimic distances 
between random points in d dimensions. Precisely, take the complete graph 
Kn on n vertices, and let the i.i.d. edge-lengths have distribution n}/'^L 
where 

P{L<x)r^x'^/d asxiO. 

This scaling keeps nearest-neighbor distances as order 1. And in the local 
approximation of this Kn by the PWIT, we simply change the distribution 
of edge-lengths at a vertex in the PWIT: the homogeneous Poisson process 
of rate 1 is replaced by an inhomogeneous Poisson process < < ^2 < • • • 
of rate x"^"^; in other words 

E(number of i with ii<x) = x'^ jd. 

For minimum matching in pseudo-dimension it is remarkable that 
(heuristically, at least) the previous analysis is in principle unchanged. The- 
orem 57 becomes [cf. (92)] 

2 

lim -EM„ = / x'^PiXx + > x) dx 
" n Jo 

where Xi and X2 are independent copies of the solution of the RDE 

(94) X= min(^i-Xi) (5 = M). 

l<i<oo 

Thus the abstract structure of the limit theorem is unchanged in pseudo- 
dimension d. But for d^l there is no known explicit solution of (94); and 
while numerical methods indicate that there is indeed a unique solution, 
rigorous proof remains elusive. 

Open Problem 63. Prove that for each real < d < 00 there is a unique 
solution to the RDE (94), and that the associated invariant RTP is endoge- 
nous. 

Similarly, in the TSP (traveling salesman problem) in pseudo-dimension d, 
a variant of the argument leading to recursion (86) leads us to the recursion 

(95) X= min[2](^i-Xi) {S = R). 

l<i<oo 

Here mint^l denotes the second minimum, and the analog of Theorem 57 is 
that the length Sn of the optimal TSP satisfies 

1 

lim-E5„= / x'^P(Xi+X2>x)dx 
" n Jo 
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where Xi and X2 are independent copies of the solution of the RDE (95). 
NumericaUy the hmit is about 2.04 for d = 1. Again numerical methods 
indicate that there is a unique solution for all d, but no rigorous proof is 
known even for d=l. 

Open Problem 64. Prove that for each real < d < 00 (or at least for 
d = l) there is a unique solution to the RDE (95), and that the associated 
invariant RTP is endogenous. 

Instead of studying minimal matchings one could study Gibbs distribu- 
tions on matchings; this leads to a different RDE ([6], (46), and [62]) 

00 

i=l 

which is somewhat in the spirit of the linear case. 

7.5. The cavity method. The nonrigorous cavity method was developed 
in statistical physics in the 1980s; see [54] for a recent survey. Though typi- 
cally applied to examples such as ground states of disordered Ising models, 
it can also be applied to the kind of "mean-field combinatorial optimization" 
examples of the last two sections. It turns out that the methodology used 
in [6] to make a rigorous proof of the mean-field matching limit serves to 
provide a general methodology for seeking rigorous proofs paralleling the 
cavity method in a variety of contexts. This is a broad and somewhat com- 
plicated topic, and the time is not ripe for a definitive survey, but it seems 
worthwhile to outline the ingredients of the rigorous methodology, pointing 
out where RDEs and endogeny arise. 

Start with a combinatorial optimization problem over some size-n random 
structure. 

• Formulate a "size-00" random structure, the ?i — > cxo limit in the sense of 
local weak convergence. 

• Formulate a corresponding combinatorial optimization problem on the 
size-00 structure. 

• Heuristically define relevant quantities on the size-00 structure via addi- 
tive renormalization [cf. (83)]. 

• If the size-00 structure is treelike (the only case where one expects ex- 
act asymptotic solutions), observe that the relevant quantities satisfy a 
problem-dependent RDE. 

• Solve the RDE. Use the unique solution to find the value of the optimiza- 
tion problem on the size-00 structure. 

• Show that the RTP associated with the solution is endogenous. 
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• Endogeny shows that the optimal solution is a measurable function of the 
data, in the infinite-size problem. Since a measurable function is almost 
continuous, we can pull back to define almost-feasible solutions of the 
size-n problem with almost the same cost. 

• Show that in the size-n problem one can patch an almost-feasible solution 
into a feasible solution for asymptotically negligible cost. 

7.6. Scaling laws in mean-field combinatorial optimization. Here we in- 
dicate current nonrigorous work on scaling exponents associated with mean 
field of combinatorial optimization problems. As indicated in the method- 
ology description above, the main requirement for making rigorous proofs 
would be proofs of uniqueness and endogeny for the RDEs which arise. So 
our discussion emphasizes the RDEs. 

7.6.1. Near-optimal solutions. In the context of mean-field minimum 

(n) 

matching, compare the optimal matching Mopt with a near-optimal match- 
ing A^^"^ by using the two quantities 

= n-^E[number edges of X^") \ TW^^], 
= n^iE[cost(X(")) - cost{M^^)]. 

Then define 

e:^(5) =minK(X("))|5„(AlW) > J}. 

We anticipate a limit e{6) = lim„e"((5), and then can ask whether there is a 
scaling exponent 

e{6) X 5° as 5 ^ 0. 

Such a scaling exponent provides a measure of how different an almost- 
optimal solution can be from the optimal solution. 

Remarkably, it is not so hard to study this question by an extension of 
the methods of Section 7.3. It turns out [10] that the key is the extension of 
the RDE (86) to the following RDE on 5 = R^: 

(96) [y|= mm{i,-{Z, + \)l{i = i*)-Yil{i^i*)) 



where 



: argmin(^i - Xj) 



and where A > is a Lagrange multiplier. In terms of the solution of this 
RDE one can define functions e(A) and (5(A) which then define the limit 
function £{5). Numerical study in [10] indicates the scaling exponent a = 3 
in both minimal matching and TSP problems in the mean-field model. 
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7.6.2. TSP percolation function. In the context of the TSP in the mean- 
field model of distance, one can study a function {p{u),0 <u<l) analogous 
to the percolation function, defined as follows. Recall Kn is the complete 
graph with random edge-lengths. Over all cycles Trn,u in Kn containing un 
vertices, let Cn,u be the minimum average edge- length of 7r„^„. We anticipate 
a limit 

\imECn.u = p{u). 

n 

It turns out [8] that what is relevant is the following RDE for a distribution 
{X,Z) on m?: 

max(A-e, + X,-Z+) \ 

i 

max(A -ii+Xi- Zf) + max^l (A - + - Zf) ^ ' 

i i / 

Here A > is again a Lagrange multiplier. In terms of the solution of this 
RDE one can define functions p(X) and u{X) which then define the limit 
function p{u). Numerical study in [8] indicates a scaling exponent p{u) x 
as M I with a = 3. Moreover, for both the present "percolation function" 
setting and the previous "near-optimal solution" setting, one can pose anal- 
ogous questions involving trees in place of tours, and it turns out [8, 10] that 
for both these questions the scaling exponent is 2. But at present we have 
no good conceptual explanation of these fascinating observations. 




7.6.3. First passage percolation. A somewhat different setup is appropri- 
ate for a mean-field model of first passage percolation. Take the 4-regular tree 
T, with in-degree 2 and out-degree 2 at each vertex; regard this as the mean- 
field analog of the oriented lattice 7?. Attach independent exponential(l) 
random variables to the edges of T. We study flows f = (/(e)) on T, 
for which the in-flow equals the out-flow at each vertex, with < /(e) < 1. 
Associated with an invariant random flow are two numbers 

9(f ) = ]E/(e): the average density of the flow, 

E/(e)C(e) 

T(f ) = — — ' flow- weighted average edge-traversal time. 

We study the function 

5*(t) := sup{9(f ) : f an invariant flow with r(f ) = r}, < r < 1. 

We have 6*{t) > iff r > rppp, where rppp is the time constant in first 
passage percolation on T. As above, to study scaling exponents the key is a 
certain RDE for S = , which turns out [7] to be 
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(98) 



Z = mill + 6 - «, -^3 + 6 - a, Yli^i + a) j 
- min 0, ^ (Zi + - o), ^ {Zi + - a) j . 

\ i=l,2 i=l,3 / 



Here a is a parameter € (tfpp, 1). In terms of the solution of this RDE one 
can define functions 6{a) and r(a) which then determine the function 5*{t). 
Numerical study in [7] indicates a scaling exponent 2: 

(5*(r) ~ 12.7(r — tfpp)^ asrlrppp. 
8. Complements. 

8.1. Numerical and Monte Carlo methods. In the context of studying a 
fixed point equation T{iJ,) = fj, or the bivariate analog in Theorem 11, there 
are several numerical methods one might try: solving the equation directly or 
calculating iterates T"(/Uo) for some convenient ^oi discretization or working 
in a basis expansion. But implementation is highly problem-dependent. 

In contrast, given an RDE X = g{^, Xi, i > 1) the bootstrap Monte Carlo 
method provides a very easy to implement and essentially problem-independent 
method. Start with a list of N numbers (take N = 10,000 say) with empirical 
distribution fiQ. Regard these as "generation-0" individuals {Xf, 1 <i < N). 
Then T{^q) can be approximated as the empirical distribution /ii of N 
"generation- 1" individuals {Xl, 1 < i < A^), each obtained independently 
via the following procedure. Take ^ with the prescribed distribution, take 
Ii,l2, ■ ■ ■ independent uniform on {1,2, . . . , N} and set 

Xl=g{^,Xl,Xl,...). 

Repeating for, say, 20 generations lets one see whether T"'(/io) settles down 
to a solution of the RDE. Note that as well as finding solutions of a given 
RDE, this method can be used to study endogeny via Theorem 11(c). 



8.2. Smoothness of densities. For linear RDEs, under appropriate as- 
sumptions one can show that fixed points are unique and have densities, 
and use this as a basis for a theoretical "exact sampling" scheme; see [26]. 
In the Quicksort example (21), smoothness of densities has been studied in 
[31]. It would be interesting to seek general smoothness results for solutions 
of max- type RDE. 
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8.3. Dependence on parameters. When an RDE depends on a parameter 
(e.g., our (77) and (96); see also examples involving multiplicative martin- 
gales for branching processes, e.g., [23], Theorem 3.3), it is natural to ask 
whether the solution depends continuously on the parameter. This has ap- 
parently not been studied in any generality. 

8.4. Continuous-time analogs. We set up recursive tree processes as discrete- 
generation processes, analogous to discrete-time Markov chains. Let us men- 
tion two contexts where continuous-parameter analogs of RTFs arise. The 
first is the classical KPP equation, which is the analog of (54) for branching 
Brownian motion; see [37] for a recent probabilistic account. The second con- 
cerns the maximum X of standard Brownian excursion of length 1 . By scal- 
ing, the maximum Xt for Brownian excursion of length t satisfies Xt = t^^'^X. 
Since excursions above higher levels are independent (conditional on their 
lengths), we can write (for infinitesimal 5) 

X = 6 + maxty^{6) Xi 

i 

where {ti{5),i > 1) are the lengths of excursions above level 5 within stan- 
dard Brownian excursion. See [18] for this kind of decomposition. 

8.5. Process-valued analogs. There are examples where the distribution 
arising in an RDE is the distribution of a stochastic process, rather than a 
single real-valued random variable. Here is an illustration. 

Birth and assassination process [9]. Start with one individual at time 
0. During each individual's lifetime, children are born at the times of a 
Poisson (rate A) process. An individual cannot die before the time of its 
parent's death (time 0, for the original individual); after that time, the 
individual lives for a further random time S, i.i.d. over individuals. Consider 
the random time X at which the process becomes extinct. It is not hard to 
show [9] that X < oo a.s. under the assumption infu>oti~^Eexp(u5) < 1/A. 

It does not seem possible to find an equation for X itself, but one can 
study a process {X{t),0 <t< oo) for which X = X{0). Specifically, first set 
up the process of all possible descendants; for a realization, for each t let 
X{t) be time until extinction, in the modification where the first individual 
has a "fictional parent" who dies at time t. One can now argue that the 
process {X{t)) satisfies the RDE 

X{t) = t + S+ max {^i + Xi{t + S -C^)) 

where (^j) are the points of a Poisson rate A process on (0,oo). This RDE 
has not been studied. 
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Open Problem 65. Study the scaling behavior of X in the limit as 
1/A I infu ti~^Eexp(nS'). 

8.6. Matchings in random regular graphs. Fix r > 2 and consider a ran- 
dom r-regular n-vertex graph Q{n,r). Attach independent exponential 1) 
distributed random weights {(,e) to edges. As in Section 4.6, let M{n,r) be 
the maximum weight of a partial matching in Q(n,r). The n — oo limit 
of Q{n,r), in the sense of local weak convergence, is the infinite r-regular 
tree T^. Thus one can seek to carry out the general program formalizing 
the cavity method (Section 7.5) in this setting. Recent work [33] provides 
interesting positive and negative results. The relevant RDE is [cf. (51)] 

(99) X= max (0,^/-^^) (5 = M+). 

— 1 

Theorem 66 ([33]). Let Tr—i be the map associated with the RDE (99). 
Then T^_i has a unique invariant distribution. Moreover, for (Xj) with the 
invariant distribution, 

lim?i~"'^EM(n, r) 

n 

= \^YA^t (i, -X,= max (e, -X,)> 

= / te-\l-e-\l-b)Y-Ut 

2 Jo 

I r(r-l)(l-6) 

2 

roo rt 

X / / te-^e-^{l-e-'{l-b)f~^{l-e-^+^{l-b)Y~^dzdt 
Jo Jo 

where b is the unique solution of b= \ — ^{i-h) ■ 

Similar results for matchings on the sparse random graph model are also 
derived in [33] . But in the "dual" problem for independent subsets the rele- 
vant RDE turns out to have nonunique solution for r > 5, and nonuniqueness 
holds also for independent sets in the sparse random graph model above a 
certain critical value. So this setting provides an important test bed for the 
range of applicability of the method. 

8.7. Random fractal graphs. A recent thesis [42] studies RDEs arising in 
the context of constructing random fractal graphs, and discusses examples 
such as the following: 

X^g{X^,X2,i) (5 = M+) 
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where 

^ = Bern(p), 

g{xi,X2,0) = 2mm(a;i,a;2), 

5(2:1, 2:2, 1) = imax(xi,X2). 

However, the form of functions g are chosen for mathematical convenience, 
rather than being derived from an underlying richer model as in our exam- 
ples. 

8.8. List of open problems and conjectures. These can be fitted into four 
categories. 

Weakening hypotheses in general theorems. 
Open Problem 12: bivariate uniqueness implies endogeny. 
Open Problem 19: solving linear RDE on M. 
Open Problem 18: endogeny for linear RDE on R"*". 
Open Problem 31: finiteness of discounted tree-sums. 
Open Problem 45: convergence of BRW extremes. 
Open Problem 49: nonendogeny of extremes in BRW. 

Existence and uniqueness of solution of particular RDEs. Here one can 
also ask about endogeny. 

Open Problem 46: extremes of branching random walk. 
Open Problem 63: mean-field matching, d^\. 
Open Problem 64: mean-field TSP. 
Ah three RDEs in Section 7.6. 

Endogeny and nonendogeny. 
Conjecture 34: discounted tree-sums. 

Open Problem 62: domain of attraction for minimum matching. 

Scaling exponents. 
Open Problem 30: range of BRW and speed of algorithmic BRW. 
Open Problem 65: birth and assassination. 
Ah three RDEs in Section 7.6. 

9. Conclusion. Here we attempt to review the big picture. 

1. RDEs in general, and max-type RDEs in particular, arise in the study 
of a wide range of underlying stochastic models. Look again at the list of 
models in Table 1. 

2. While for linear RDEs one has hope of general theory, the diversity of 
forms of the function g{-) listed in Table 1 makes it hard to envisage a gen- 
eral theory which encompasses many max-type examples in one axiomatic 
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framework. Indeed it is not clear how to make any useful classification of our 
examples; we have given only an informal "simple/not simple" classification 
(start of Sections 4 and 5) based on whether there is a relatively easy a.s. 
construction of random variables satisfying the RDE. 

3. The cavity method (Section 7.5) provides a range of examples new 
to the probability community. Existence and uniqueness of solutions has 
been proved rigorously only in the special settings of mean-field matching 
in pseudo-dimension 1 (Section 7.3) and matchings and independent sets in 
random graphs (Section 8.6). It remains a challenge to establish existence 
and uniqueness in the other examples of Sections 7.4 and 7.6. 

4. What is new in this survey is the discussion of endogeny, both the 
(fairly straightforward) general theory in Section 2, and the analysis of ex- 
amples. In some cases one can study endogeny in a model parameterized by 
a distribution ^ (e.g., Corollaries 17 and 26; Proposition 48) but in other 
cases (Theorems 21, 55 and 61) the argument relies on analytic calculations 
based on knowing an explicit formula for the invariant distribution for a 
specific ^. For making the cavity method rigorous, one would like techniques 
to establish endogeny without knowing such explicit formulas. 
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